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Abstract 

We discuss contact geometry naturally related with optimal control problems (and Pontryagin 
Maximum Principle). We explore and expand the observations of Ohsawa HOhslSL providing simple 
and elegant characterizations of normal and abnormal sub-Riemannian extremals. 
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1 Introduction 

A contact interpretation of the Pontryagin Maximum Principle. In a recent paper Ohsawa BOhslSIl 
observed that for normal solutions of the optimal control problem on a manifold Q, the Hamiltonian evo¬ 
lution of the covector in T* {Q x M) considered in the Pontryagin Maximum Principle (PMP, in short), 
projects to a well-defined contact evolution in the projectivization P(T*((5 x M)). Here Q x M is the ex¬ 
tended configuration space (consisting of both the configurations Q and the costs M) and P(T* (Q x M)) is 
equipped with a natural contact structure. Moreover, Ohsawa observed that the maximized Hamiltonian 
of the PMP is precisely the generating function of this contact evolution. 

The above result was our basic inspiration to undertake this study. Our goal was to understand, from 
a geometric viewpoint, the role and origins of the above-mentioned contact structure in the PMP and to 
study possible limitations of the contact approach (does it work alike for abnormal solutions, etc.). 
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As a result we prove Theoreni |5.6[ a version of the PMP, in which the standard Hamiltonian evolution 
of a covector curve At in T*(Q x M) along an optimal solution q{t) G Q x M is substituted by a 
contact evolution of a curve of hyperplanes 'Ht in T((5 x M) along this solution. (Note that the space 
of all hyperplanes in T((5 x M) is actually the manifold of contact elements of Q x M and can be 
naturally identified with P(T*((5 x M)).) It is worth mentioning that this result is valid regardless of 
the fact whether the solution is normal or abnormal and, moreover, the contact evolution is given by a 
natural contact lift of the extremal vector field (regarded as a time-dependent vector field on Q x M). 
Finally, using the well-known relation between contact vector fields and smooth functions we were able 
to interpret the Pontryagin maximized Hamiltonian as a generating function of the contact evolution of 
-Ht. 

It seems to us that, apart from the very recent paper of Ohsawa HOhslSl . the relation between optimal 
control and contact geometry has not been explored in the literature. This fact is not difficult to explain 
as the PMP in its Hamiltonian formulation has been very successful and as symplectic geometry is much 
better developed and understood than contact geometry. In our opinion, the contact approach to the PMP 
seems to be a promising direction of studies for at least two reasons. First of all it allows for a unified 
treatment of normal and abnormal solutions and, secondly, it seems to be closer to the actual geometric 
meaning of the PMP (we shall justify this statement below). 


About the proof. The justification of Theorem 5.6 is rather trivial. In fact, it is just a matter of in¬ 
terpretation of the classical proof of the PMP IIPMBG62II (see also HLewOhll and OLib 1211 1. Recall that 
geometrically the PMP says that at each point of the optimal trajectory q{t), the cone Kt C Tqp)(QxM) 
approximating the reachable set can be separated, by a hyperplane Tit C Tqp^{Q x M), from the direc¬ 
tion of the decreasing cost (cf. Figure [^. Thus in its original sense the PMP describes the evolution of 
a family of hyperplanes 'Ht (i■e■^ a curve in the manifold of contact elements of Q x M, identified with 
P(T*(Q X M))) along the optimal solution. This evolution is induced by the flow of the optimal control 
on <5 X M. From this perspective the only ingredient one needs to prove Theorem 5.6 is to show that 


this flow induces a contact evolution (with respect to the natural contact structure) on P(T*(Q x 
It is worth mentioning that the covector curve At G T* (Q x M) from the standard formulation of the 
PMP is nothing else than just an alternative description of the above-mentioned curve of hyperplanes, 
i.e., 'Ht = ker At for each time t. Obviously, there is an ambiguity in choosing such a At, which is 
defined up to a rescaling. 


Applications. From the above perspective it is obvious that the description of the necessary conditions 
for optimality of the PMP in terms of Tit’s (the contact approach) is closer to the actual geometric 
meaning of the PMP as it contains the direct information about the separating hyperplanes. On the 
contrary, in the Hamiltonian approach this information is translated into the language of covectors (not 
to forget the non-uniqueness of the choice of A^). 

We illustrate the contact approach to the PMP by discussing its applications to the sub-Riemannian 
(SR, in short) geodesic problem in Section]^ Recall that a SR geodesic problem on a manifold Q is an 
optimal control problem in which the controls parametrize trajectories tangent to a smooth distribution 
'D C TQ and the cost of a trajectory is its length calculated via a given positively defined bilinear 
form (7 : 2? X P — )■ M (the SR metric). Actually, due to the Cauchy-Schwartz inequality, the trajectories 
minimizing the length are exactly those that minimize the kinetic energy and are parametrized by the arc- 
length. In such a setting, using some elementary geometric considerations, we were able to relate V and 
g with the separating hyperplanes Ht (Lemma [6.4| ). In consequence, still using elementary arguments, 
the following two results about SR extremals were derived: 

• Theorem [6]^ completely characterizes abnormal SR extremals. It states that an absolutely contin¬ 
uous curve q{t) £ Q tangent to V is an abnormal extremal if and only if the minimal distribution 
along q{t) which contains 'Dgp^ and is invariant along q{t) under the flow of the extremal vector 
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field is of rank smaller than dim Q. As a special case (for smooth vector fields) we obtain, in 
Corollary |6.8[ the following result: if the distribution spanned by the iterated Lie brackets of a 
given 2?-valued vector field X G T{T>) with all possible P-valued vector fields, i.e., 

(adi(Z)lZGr(P), A: = 0,1,2,...) 

is of constant rank smaller than dim Q, then the integral curves of X are abnormal SR extremals. 


Theorem 6.15 in a similar manner (yet under an additional assumptions that the controls are nor¬ 
malized with respect to the SR metric g) provides a complete characterization of normal SR ex¬ 
tremals. It states that an absolutely continuous curve q{t) G Q, tangent to V, is a normal extremal 
if and only if it is of class with an absolutely continuous derivative and if the minimal distribu¬ 
tion along q{t) which contains these elements of T’q(t) that are ^r-orthogonal to q{t) and is invariant 
along q{t) under the flow of the extremal vector field does not contain the direction tangent to q{t) 
at any point. Again in the smooth case we conclude, in Corollary |6.16[ that if for a given normal¬ 
ized vector field X G T{T>) the distribution spanned by the iterated Lie brackets of X with all 
possible 2?-valued vector fields g'-othogonal to X, i.e.. 


(ad^(Z)lZGr(P), 5(^,X)=0, A: = 0,1,2,...) 


is of constant rank and does not contain X at any point of q{t), then the integral curves of X are 
normal SR extremals. 


The first of the above results seems not to be present in the literature. Of course a characterization 
of abnormal extremals in Hamiltonian terms is well-known (see, e.g., IISus98ll ) and as such has been 
widely used. In many particular cases (see, e.g., IIJKP091 1 it allowed to obtain criteria to derive abnormal 
extremals similar to the smooth version of Theorem 6.7 The second of the above results appears in its 
smooth version in IILS95II for rank-2 distributions, and in the general version in IIAOP971I in a formulation 
equivalent to ours. For these reasons we do not claim to be the first to obtain the above results (although 
our simple formulations using the language of flows of the optimal control seem to be new). What we 
believe, however, to be an added value is the simplicity of derivation of these results in our approach. 
Indeed, our proofs use only basic geometric tools, actually nothing more sophisticated than the definition 
of the flow, the derivation of the tangent space to a paraboloid, and the Gram-Schmidt algorithm. 

It should be stressed that the language of flows used throughout is much more effective, and in fact 
simpler, than the language of Lie brackets usually applied in the study of SR extremals. Indeed, the 
assertions of Theorems 6.7 and 6.15 are valid for non-smooth, i.e., absolutely continuous curves and 
bounded measurable controls, do not require any regularity assumptions (contrary to the characterization 
in terms of Lie brackets) and work for single trajectories (not necessary families of trajectories). 

As an illustration of the above results we give a few examples. In particular, in Examples |6.5| and [6. 17| 
we were able to provide a surprisingly easy derivation of the Riemannian geodesic equation (obtaining 
the equation Vy 7 = 0 from the standard Hamiltonian approach is explained in IIABB12[ ISus98l l. In 
Examples 6.10[ |6.14[ and 6.18 we re-discover some results of IIES95II and |Zhi95l concerning rank-2 
distributions. 


Organization of the paper. We begin our considerations by a technical introduction in Section 
Our main goal in this part is to introduce, in a rigorous way, natural differential geometric tools (Eie 
brackets, flows of time-dependent vector fields, distributions, etc.) in the non-smooth and time-dependent 
setting suitable for control theory (in general, we consider controls which are only locally bounded and 
measurable). Most of the results presented in this section are natural generalizations of the results well- 
known in the smooth case. They are essentially based on the local existence and uniqueness of solutions 
of ODE in the sense of Caratheodory (Theorem | A. 2| ). To avoid being too technical, we moved various 
parts of the exposition of this section (including some proofs and definitions) to Appendix [A| 
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In Section we briefly recall basic definitions and constructions of contact geometry. In particular, 
we show an elegant construction of contact vector fields (infinitesimal symmetries of contact distribu¬ 
tions) in terms of equivalence classes of vector fields modulo the contact distribution. This construction 
is more fundamental than the standard one in terms of generating functions (which requires a particular 
choice of a contact form). It seems to us that so far it has not been present in the literature. 

In Section]^ we discuss in detail a natural contact structure on the projectivization of the cotangent 
bundle P(T*M). In particular, we construct a natural contact transformation F{F) of P(T*M) induced 
by a diffeomorphism F of M. Later we study an infinitesimal counterpart of this construction, i.e., a 
natural lift of a vector field X on M to a contact vector field Cx on P(T*M). 

In Section we introduce the optimal control problem for a control system on a manifold Q and 
formulate the PMP in its standard version (Theorem |5.1| ). Later we sketch the standard proof of the PMP 
introducing the cones Kt and the separating hyperplanes FCt- A proper interpretation of these objects, 
together with our previous considerations about the geometry of P(T*M) from Section|^ allows us to 
conclude Theorems |5.6| and [5T^ which are the contact and the covariant versions of the PMP, respectively. 

Finally, in the last Section we concentrate our attention on the geometry of the cones Kt and 
hyperplanes Fit for the Riemannian and sub-Riemannian geodesic problems. The main results of that 
section, which characterize normal and abnormal SR extremals, were already discussed in detail in the 
paragraph “Applications” above. 

2 Technical preliminaries 

As indicated in the Introduction, in this paper we shall apply the language of differential geometry to 
optimal control theory. This requires some attention as differential geometry uses tools such as vector 
fields, their flows, distributions and Lie brackets which are a priori smooth, while in control theory it is 
natural to work with objects of lower regularity. The main technical difficulty is a rigorous introduction of 
the notion of the flow of a time-dependent vector field (TDVF, in short) with the time-dependence being, 
in general, only measurable. A solution of this problem, provided within the framework of chronological 
calculus, can be found in IIAS04I . The recent monograph llJL14l with a detailed discussion of regularity 
aspects is another exhaustive source of information about this topic. 

Despite the existence of the above-mentioned excellent references, we decided to present our own 
explication of the notion of the flow of a TDVF. The reasons for that decision are three-fold. First of all 
this makes our paper self-contained. Secondly, we actually do not need the full machinery of IIAS041I 
or IIJL14II . so we can present a simplified approach. Finally, for future purposes we need to concentrate 
our attention on some specific aspects (such as the transport of a distribution along an integral curve of 
a TDVF and the relation of this transport with the Lie bracket) which are present in neither IIAS041 . nor 
iJLT4l . Our goal in this section is to give a minimal yet sufficient introduction to the above-mentioned 
concepts. We move technical details and rigorous proofs to Appendix [A| 

Time-dependent vector fields and their flows. Let M be a smooth manifold. By a time-dependent 
vector field on M {TDVF, in short) we shall understand a family of vector fields Xt G 3i{M) parametrized 
by a real parameter t (the time). Every such a field defines the following non-autonomous OD^on M 

(2.1) x{t) = Xt{x{t)) . 


below guarantees that solutions of p.l| ) (in the sense of Caratheodory) locally exist, are unique and 
are absolutely continuous with bounded derivatives {ACB, in short, see Appendix with respect to 

'Sometimes it is convenient to identify a TDVF Xt on M with the vector field X{x, t) = Xt{x) -\- dt on M x R. Within 
this identification equation ilD is an M-projection of the autonomous ODE (x, i) = X{x, t) defined on M x R. 


A technical assumption that the map {x,t) i— )• Xfix) is Caratheodory 
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the time t. For this reason from now on we shall restrict our attention to TDVF’s Xt satisfying the 
above assumption. We will call them Caratheodory TDVF's. In a very similar context the notion of a 
Caratheodory section was introduced in the recent monograph IIJL141I . Actually, in the language of the 
latter work our notion of a Caratheodory TDVF would be called a locally bounded Caratheodory vector 
field of class C^. 

A solution of ( |2.1| ) with the initial condition x{to) = xq will be denoted by x{t; to,xo) and called an 
integral curve of Xt. When speaking about families of such solutions with different initial conditions it 
will be convenient to introduce (local) maps Auq : M ^ M defined by Atto{xo) '■= x{t; to, xq). 

Lemma 2.1. Let Xt G X{M) be a Caratheodory TDVF on M. Then 

• For t close enough to to the maps Auq : M ^ M are well-defined local dijfeomorphisms. 

• Moreover, they satisfy the following properties 

(2.2) At^to — and Atr^AT-tg) — > 


whenever both sides are defined. 


Since Xt is Caratheodory, it satisfies locally the assumptions of Theorem A. 2 


of Lemma 2.1 follows directly from the latter result. Properties 
fact that 1 1 — )■ Auq (®o) is an integral curve of Xt. 


are merely 


Now the justification 
a consequence of the 


Definition 2.2. The family of local diffeomorphisms Atr : M ^ M described in the above lemma will 
be called the time-dependent flow of Xt (TD flow, in short). 


Clearly Auq is a natural time-dependent analog of the notion of the flow of a vector field. This 
justifies the name “TD flow”. It is worth noticing that, alike for the standard notion of the flow, there is 
a natural correspondence between TD flows and Caratheodory TDVF’s. 

Lemma 2.3. Let Atr '■ M M be a family of local diffeomorphisms satisfying ( |2.2| ) and such that 
for each choice of xo G M and to £ the map t i—)• Auffxo) is ACB. Then Atr is a TD flow of some 
Caratheodory TDVF Xt. 

The natural candidate for such a TDVF is simply Xt (^) - ^\r- _^AT-t{x). The remaining details are 
left to the reader. 


Distributions along integral curves of TDVF’s. In this paragraph we shall introduce basic defini¬ 
tions and basic properties related with distributions defined along a single ACB integral curve x{t) = 
x(f; to, Xo) (with t G [to, fi]) of a Caratheodory TDVF Xt. In particular, for future purposes it will be 
crucial to understand the behavior of such distributions under the TD flow Atr of Xt. 

Definition 2.4. Let x{t) = x{t; to, xq) with t G [to, fi] be an integral curve of a Caratheodory TDVF 
Xt. A distribution B along x{t) is a family of linear subspaces C attached at each point 

of the considered curve. In general, the dimension of B^p) may vary from point to point. 

By an ACB section ofB we will understand a vector field Z along x{t) such that Z{x(t)) G for 
every f G [to, fi] and that the map t i—)• Z{x{t)) is ACB. The space of such sections will be denoted by 
^ACb{S). a distribution B along x{t) shall be called charming if it is point-wise spanned by a finite set 
of elements of Tacb{S). 

We shall say that B is Atr-invariant (or respected by a TDflow Atr) along x{t) if 

^x{t) = TAt.r(^a;(r)) 

for every t,T e Equivalently, B^p^ = TAttgiB^^^.p^'^) for every t G In particular, if B is 

respected by Atr along x{t) then it is of constant rank along x{t). This follows from the fact that each 
map Atr is a local diffeomorphism. 
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Let us remark that the idea behind the notion of a charming distribution is to provide a natural substi¬ 
tution of the notion of smoothness in the situation where a distribution is considered along a non-smooth 
curve. Observe namely that a restriction of a smooth vector field on M to an ACB curve x{t; to,xo) is a 
priori only an ACB vector field along x{t]to,xo). 

Proposition 2.5. Charming distributions appear naturally in the following two situations: 

• A restriction of a locally finitely generated smooth distribution on M to an ACB curve x{t) = 
x{t; to, xo) A charming. 


• Let Afr be the TD flow of a Caratheodory TDVF Xt and let B be a distribution along an integral 
curve x{t) = x{t; to, xq) of Xt. Then ifB is Atr-invariant along x{t), it is also charming. 


The juslificafion of fhe above resulf is slraighlforward. Regarding fhe firsl sifuafion if was already 
observed fhaf a resfricfion of a smoofh vecfor field fo an ACB curve is an ACB vecfor field. In fhe 
second sifuafion, fhe disfribufion B is spanned by vecfor fi elds TAtt^iX'-) wifh i = 1,... ,k, where 
..., X^} is any basis of Bxq. By fhe resulfs of Lemma . 


A.3 


fhese fields are ACB. 


Given a disfribufion B along x{t) we can always exfend if fo fhe smallesf (wifh respecf fo inclusion) 
disfribufion along x{t) confaining B and respecfed by fhe TD flow Atr along x{t). This consfrucfion will 
play a crucial role in geomefric characferizafion of normal and abnormal SR exfremals in Section 


Proposition 2.6. Let x{t) = x{t; to,xo) with t G [fg, fi] be a trajectory of a TDVF Xt. Let Atr be the 
TD flow of Xt and let B be a distribution along x{t). Then 


A,iB)x(t) ■= vect]R{TAt.r(A) | X e Bx(t), h < t < h} 

is the smallest distribution along x{t) which contains B and is respected by the TDflow Atr along x{t). 

Obviously, any distribution Atr-invariant along x{t) and containing B^p) must contain A,(B)x(t)- 
The fact that the latter is indeed Atr-invariant along x(t) follows easily from property (|2.2|). 


Lie brackets and distributions. Constructing distributions Atr-invariant along x(t) introduced in 
Proposition |2.6[ although conceptually very simple, is not very useful from the practical point of view, 
as it requires calculating the TD flow Atr. This difficulty can be overcome by passing to an infinitesimal 
description in terms of the Lie brackets, however, for a price of loosing some generality. In this paragraph 
we shall discuss this and some related problems in detail. 

Definition 2.7. Let Xt be a Caratheodory TDVF and x(t) = x(t;to,xo) its integral curve. Given any 
smooth vector field Z G X{M) we define the Lie bracket of Xt and Z along x{t) by the formula 


[■^t, ^]x{t) ) 


i.e., we calculate the standard Lie bracket [Xt, Z] “freezing” the time t and then evaluate it at the point 
x{t), thus obtaining a well-defined field of vectors along x{t) (the regularity of the map 1 1 —)■ [Xt, Z]xp) 
is a separate issue that we shall discuss later). 

For future purposes we would like to extend Definition |2.7| to be able to calculate the bracket [Xt, Z]xp) 
also for fields Z of lower regularity. That can be done, but at a price that the bracket [Xt, Z]xp) would 
be defined only for almost every (a.e., in short) t G [to, ti]. The details of this construction are provided 
below. 

As a motivation recall that for M = given two smooth vector fields X, Z G X(M"') (understood 
as maps X, Z : MF M”) their Lie bracket at a point xo equals 




A 

dt 




A 

ds 


X(z(s)) , 

s=0 
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where 1 1— x{t) is the integral curve of X emerging from xq at time 0 (in particular, ^ iQx(f) = X(xo)) 
and s I—z(s) is the integral curve of Z emerging from xq at time 0 (in particular, ^ |q^;(s) = Z{xo)). 
The above formula, actually, allows to define [X, Z]^^, on any smooth manifold M, simply by taking it 
as the definition of the Lie bracket [X, Z]xfj in a particular local coordinate system on M. It is an easy 
exercise to show that [X, Z]xq defined in such a way is a frue geomefric objecf (i.e., if does nof depend 
on fhe parficular choice of a local charf). Nofe fhaf in order fo calculafe [X, Z]xq we need only fo know 
X along s !-)> z(s) and Z along t x{t). 

Observe fhaf fo use direcfly fhe above compufafional definition fo calculafe fhe Lie brackef [Xt, Zjxp'j 
along x{t) = x{t] to, xo) we should use a separafe integral curve of fhe field Xt (wifh “frozen” lime) for 
every t G [fo, fi], i-e.. 


(2.3) 


[Xt, Z]x(^t) 


dr 


Z{xt{T)) 


d_ 

ds 


Xt{z{s,t)) , 

s=0 


where for each t G [fo, fi] the map r ^ xt{T) is the integral curve of the field Xt emerging from the 
point x{t) at time t = t and s i— z{s, t) = z{s; 0, x(f)) is the integral curve of Z emerging from x(f) 
at s = 0, i.e., z{t,0) = x{t) = x(f;fo,xo) and ^|Q 2 (f, s) = Z{x{t)). Observe now that by definition 
xt{t) = Xt{x{t)) = x(f) and thus p.3| ) holds for xt(r) = x{t). What is more, p.3| ) is well-defined 
at a given time f G [to, fi] also for any vector field Z on M (not necessarily smooth) such that the map 
T I—)• Z{x{t)) is differentiable at r = f. This observation justifies the following statement 


Proposition 2.8. Assuming that t i— ?■ Z{x{t)) is an ACB map and that Xt is a Caratheodory TDVF, the 
Lie bracket [Xt, Z]xp) is defined by formula ( |2.3| ) almost everywhere along x(f). In fact, it is well-defined 
at all regular points oft^ Z{x{t)). Moreover, t i— [Xt, Z]x{t) is a measurable and locally bounded 
map. 

The Lie bracket [Xt, Z]xp), is completely determined by the values of Z along x{f) and by the values 
of Xt in a neighborhood of x{t). 

In other words, formula p.3| ) is an extension of Definition |2.7| which allows to calculate the Lie 
bracket [Xt, Z]^^') at almost every point of a given integral curve x{t) of Xt, for vector fields Z defined 
only along x(f) and such that f i—)■ Z{x{t)) is ACB. The latter generalization is necessary in control 
theory, since, as f i—x(f) is in general ACB only, even if Z is a smooth vector field, we cannot expect 
the map f i—)■ Z{x{t)) to be of regularity higher than ACB. 

The above construction of the Lie bracket [Xt, Z]x(p allows to introduce the following natural con¬ 
struction. 


Definition 2.9. Let Xt be a Caratheodory TDVF, x{t) = x(f;fo,xo) (with f G [fo,fi]) its integral 
curve and let .6 be a distribution along x(f). By [Xt, B] we shall understand the distribution along x(f) 
generated by the Lie brackets of Xt and all ACB sections of B: 

[Xt,B]x(t) := vectK{[Xt,y]3,(i) | Y G Tacb{B)} , 

where we consider [Xt, y]x{t) ^t all points where it makes sense, i.e., at which the bracket [Xt, Y]x(t) is 
well-defined. 

A charming distribution B along x(f) will be called Xt-invariant along x(f) if 
[Xt, B]x(t) C Bx(t) for almost every f G [fo, fi]. 

Note that neither [Xt, B] nor B -|- [Xt, B] need be charming distributions along x(f) even if so was 
B as, in general, there is no guarantee that these distributions will be spanned by ACB sections (we can 
loose regularity when calculating the Lie bracket). 

The following result explains the relation between the Atr- and A*-invariance of distributions along 
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Theorem 2.10. Let B be a distribution along x{t) = x(t; tg, xg) (with t G [to, ti]j, an integral curve of 
a Caratheodory TDVF Xt, and let Atr be the TDflow ofXp The following conditions are equivalent 

(a) B is respected by the TDflow Atr of Xt along x{t). 


(b) B is a charming distribution Xfinvariant and of constant rank along x{t). 

The proof is given in Appendix]^ Note that the equivalence between Xf and A^T-invariance is valid 
only if the considered distribution B along x{t) satisfies regularity conditions: it has to be charming and 
of constant rank along x(t). 

Given a charming distribution B along x{t), it is clear in the light of the above result, that A,(B)x{t)^ 
the smallest distribution Atr-invariant along x{t) and containing B, should be closed under the operation 
[Xt, •]. Thus, in the smooth case, it is natural to try to construct A,(B) in the following way. 


Lemma 2.11. Let X be a C°°-smooth vector field and let B a C°°-smooth distribution on M. Assume 
that along an integral curve x{t) = x{t; tg,xg) of X (with t G [to, ti]j, the distribution spanned by the 
iterated Lie brackets of X with all possible B-valued vector fields, i.e., 

(2.4) ad^(S),p):=(adi(Z),(i)|ZGr(S), A: = 0,1, 2,... ) 


is of constant rank along x(t). Then ad^ ('B)a;(t) t/ie smallest distribution along x(t) containing B^p^ 
and respected by At, the flow of X, i.e., adx {B)x(t) = A,{B)x(t)- 


Proof. The justification of the above result is quite simple. By construction, ad^ {^)x{t) is the smallest 
distribution along x{t) containing B^(t) closed under the operation adx = [X, •]. It is clear that 
ad^(.B) is spanned by a finite number of smooth vector fields of the form ad\(Z), where Z G r(.B), 
and thus it is charming. Since it is also of constant rank along x{t) we can use Theorem 2.10 (for a 
time-independent vector field X) to prove that ad^(B)^p^ is invariant along x{t) under the flow At. 
We conclude that A,(B)x(t) C adx {B)x(t)- On the other hand, since A,(B)x(t) is A^-invariant along 
x{t), again by Theorem 2.10 it must be closed with respect to the operation [X, ■]. In particular, it must 
contain the smallest distribution along x(t) containing B^p) and closed under the operation [X, ■]. Thus 
A,(B)^p-^ D adxiB)„.py This ends the proof. □ 


Remark 2.12. Let us remark that the construction provided by \2.A) would be, in general, not possible 
in all non-smooth cases. The basic reason is that the Lie bracket defined by p.3[ ) is of regularity lower 
than the initial vector fields, i.e., [Xt, Z] may not be ACB along x{t) even if so were Xt and Z. Thus by 
adding the iterated Lie brackets to the initial distribution B, we may loose the property that it is charming 
(cf. also a remark following Definition [Z^ which is essential for Theorem 2.10 to hold. 

Also the constant rank condition is important, as otherwise the correspondence between Xt- and Atr- 
invariance provided by Theorem [2 .1 0| does not hold. If (^1 is not of constant rank along x(t) we may 
only say that adx {B)x[t) G A,{B)j.p^ (see also Remark 


6.91. 


It is worth noticing that this situation resembles the well-known results of Sussmann IISus73ll concern¬ 
ing the integrability of distributions: being closed under the Lie bracket is not sufficient for integrability, 
as the invariance with respect to the flows of distribution-valued vector fields is also needed. After adding 
an extra assumption that the rank of the distribution is constant, the latter condition can be relaxed. 


By the results of Proposition [2^ the property that a distribution B is 2ft-invariant along x{t) depends 
not only on B and the values of a Caratheodory TDVF Xt along x{t), but also on the values of Xt 
in a neighborhood of that integral curve. It turns out, however, that in a class of natural situations the 
knowledge of Xt along x{t) suffices for checking the Xj-invariance. 


Lemma 2.13. Let D be a smooth distribution of constant rank on M, Xt a Caratheodory D-valued 
TDVF and x(t) = x(f; tg, xg) (with t G [tg,ti]) an integral curve ofXt. Let B be a charming distribution 
along x{t), such that D^p-^ C Bxp)for every t. Then the property of B being Xt-invariant along x{t) 
depends only on the values of Xt along x(t). 
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The proof is given in Appendix [A| 


3 The basics of contact geometry 

Contact manifolds and contact transformations. In this section we shall recall basic facts from con¬ 
tact geometry. A contact structure on a manifold Ad is a smooth co-rank one distribution C C TAf 
satisfying certain maximum non-degeneracy condition. In order to formalize that condition we introduce 
the following geometric construction. From now on we shall assume that the pair (Ad,C) consists of a 
smooth manifold Ad and a smooth co-rank one distribution C on Ad. Sometimes it will be convenient to 
treat C as a vector subbundle of TAd. 

Given (Ad, C) one can define the bundle normal to C in TAf as the quotient 

NC :=TAf/C . 

Note that NC has a natural structure of a line bundle (since C is of co-rank one) over Ad. We shall denote 
this bundle by r : NC —)■ Ad. 

Let now X and Y be two C-valued vector fields on Af. If is easy fo check fhaf fhe class of fheir Lie 
brackef [A, Y] in NC is fensorial wifh respecf fo bofh X and Y. Thaf is, for any pair of smoofh funcfions 

l(f>-X,^-Y] = (^^- [X, Y] mod C . 

If follows fhaf fhe assignmenf 

(X,Y) ^/3(X,Y) := IX,Y] mod C , 

defines a NC-valued 2-form /3 : h?C —)• NC . Now we are ready fo sfafe fhe following 

Definition 3.1. A pair {M ., C) consisfing of a smoofh manifold Af and a smoofh co-rank one disfribufion 
C C TAd is called a contact manifold if fhe associated NC-valued 2-form /f is non-degenerate, i.e., if 
/3(A, •) = 0 implies X = 0. 

Sometimes we call C a contact structure or a contact distribution on Af. 

Observe fhaf C is necessarily of even-rank (A is odd-dimensional). This follows from a simple facl 
from linear algebra fhaf every skew-symmefric 2-form on an odd-dimensional space has a non-frivial 
kernel. 

Definition 3.2. Let (A,C) be a contact manifold. A diffeomorphism F : A —)• A which preserves 
the contact distribution, i.e., TF{Cp) = Cp(^p'j for every p G A, where TF stands for the tangent map of 
F, is called a contact transformation. By a contact vector field (CVF in short) on A (or an infinitesimal 
symmetry of (A,C)) we shall understand a smooth vector field X G j£(A) preserving the contact 
distribution C, i.e., 

[A,C] CC. 

Equivalently, A is a CVF if and only if its (local) flow At consists of contact transformations (cf. Theo¬ 
rem |2T^. 

It is worth mentioning that the above relation between contact vector fields and flows consisting of 
contact transformations can be generalized to the context of TDVF’s and TD flows (cf. Section |^. We 
will need this generalized relation in Section [Rafter introducing control systems. 

Proposition 3.3. Let Xt be a Caratheodory TDVF on a contact manifold (A, C) and let Atr be the TD 
flow ofXt- Then Xt is a contact vector field for every t G M (i.e., [Xt, C] C C) if and only if the TDflow 
Atr consists of contact transformations. 

The proof follows directly from Theorem|2.10|by taking B = C (which is charming - cf. Proposition 

1231). 
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Characterization of CVF’s. It turns out that there is a one-to-one eorrespondenee between CVF’s on 
Ai and sections of the normal bundle NC. 

Lemma 3.4. Let X G X(Ad) be any representative of the class [X] G r(NC). By h{X) G r(C) we shall 
denote the unique C-valued vector field satisfying 

[h{X),Y] = [Y,X] mode 

for every Y G r(C). The assignment [X] i—)■ C^x] '■= X + h{X) is well-defined and establishes a 
one-to-one correspondence between sections o/NC and CVF’s on Ai. 

Remark 3.5. Throughout we will denote by X G X{A4) vector fields on At, by L G r(C) vector fields 
valued in C and by C (also wifh varianfs, like Cyx] or Cf) confacf veefor fields. 

Proof Lef us begin wifh infroducing fhe following geomefric consfruefion. Wifh every smoofh veefor 
field X G X{Ai) one can associate a NC-valued 1-form ax ■ C —)■ NC defined by fhe formula 

ax{Y) ■.= [Y,X] mode, 

where X is a C-valued veefor field. The correefness of fhis definifion follows from fhe facl fhaf for every 
C-valued veefor field Y and for any funclion f G C°^{Ai) we have 

[0 • y, X] = (/. • [y, X] mod c. 

Using fhe one-form ax we can prove fhaf [X] i—)■ Cyx] is a well-defined map, i.e., fhaf fhe value of 
Cjx] does nol depend on fhe choice of fhe represenfafive X. Indeed, we can inferpref h{X) as fhe unique 
(note fhaf /3 is non-degenerate) solufion of fhe equafion axf ) = P{h{X),-). Now observe fhaf if X and 
X' are fwo differenf represenfafives of [X], fhen Y := X' — X is a C-valued veefor field on A4. Thus 
we have ay (•) = —/3{Y, •) and hence, using fhe obvious linearify of ax wifh respeef fo X, we gel 

/3(fi(X'), •) = axff = ax+Y{-) = axf) + ay(-) = fi{h{X), •) - (3{Y, •) = /)(/i(X) - X, •) • 

We conclude fhaf h{X') = h{X) — Y and, consequenlly, 

C[x'] =x' + h{x') = X + y + hix) - y = X + h{x) = Cyx] ■ 


Secondly, observe fhaf C\^x] is a CVF. Indeed, by consfruefion, given any C-valued veefor field Y we 
have 

[C^x ], X] = [X + h{X), y] = 0 mod C , 
i.e., [C\^x ], y] is a C-valued veefor field. 

Finally, we need fo check fhaf every CVF is of fhe form Cyxy By consfruefion fhe class of Cyx] in 
NC is equal fo fhe class of X in NC (fhese fwo vector fields differ by a C-valued veefor field h{X)). 
Thus fhe classes of CVF’s of fhe form Cyx] realize every possible seefion of NC. Now if is enough to 
observe fhaf fhe NC-class uniquely determines a CVF. Indeed, if C and C are fwo CVF’s belonging fo 
fhe same class in NC, fhen fheir difference X — X' is a C-valued CVF, i.e., [C — C\ X] = 0 mod C for 
any C-valued vector field X. Thai is, j5{C — C, •) = 0 and from fhe non-degeneracy of we conclude 
fhaf C — C = 0. This ends fhe proof. □ 

Remark 3.6. If is nalural to call a vector field X G X{A4) (or ils NC-class [X]) a generator of fhe CVF 
Cyxy Observe fhaf fhe NC-class of fhe CVF Cyx] is the same as the class of its generator X (they differ 
by a C-valued vector field h{X)). 
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In the literature, see e.g., IILM87I . a contact distribution C on Ad is often presented as the kernel 
of a certain 1 -form io G A^(Af) (such an io is then called a contact form). In the language of oj, the 
maximum non-degeneracy condition can be expressed as the non-degeneracy of the 2 -form dw on C. 
The latter is equivalent to the condition that a; A (dw)^”, where u, = | rankC, is a volume form on Af 
(i.e., uj A (dw)^"' 7 ^ 0). 

Also CVF’s have an elegant characterization in terms of contact forms. One can show that CVF’s 
are in one-to-one correspondence with smooth functions on Ad. Choose a contact form a; such that 
C = kerw, then this correspondence is given by an assignment f 1 —)• C^, where is the unique 
vector field on M such that (jc{C^) = f and {C^jduj)\c = — d flc- A function f is usually called the 
generating function of the corresponding CVF associated with the contact form w. Notice that given 
a contact vector field C = Cff, and a confacf form w one can recover fhe generafing function simply by 
evaluating ui on C, i.e., f = u}{C). 

If is inferesfing fo relafe fhe consfrucfion f wifh fhe consfrucfion [X] 1 —)• C^x] given above. 

Namely, fhe choice of a confacf form uj allows fo infroduce a vecfor field R G X(Af) (known as fhe 
Reeb vector field) defined uniquely by fhe conditions uj{R) = 1 and Rjdoj = 0. Since R is nol 
confained in C = ker cj, ifs class [R] esfablishes a basis of fhe normal bundle NC. Consequenfly, we can 
idenfify smoofh functions on Af wifh secfions of NC, via 4> [(t^R]- Now if is nol difficull fo prove lhal 

Cij) = and conversely fhaf C^x] = C<j) for f Tho delails are lefl fo fhe reader. 

Nofe, however, fhaf fhe descripfion of fhe confacf dislribufion C in ferms of a confacf form cj is, in 
general, non-canonical (as every rescaling of cj by a nowhere-vanishing funcfion gives fhe same kernel 
C) and valid only locally (as fhere clearly exisf confacf dislribulions which cannol be globally presenfed 
as kernels of single 1-forms). For fhis reason fhe description of a confacf manifold (Ad, C) in ferms of 
C and relafed objecfs (e.g., NC, (3) is more fundamenlal and offen concepfually simpler (for example, in 
fhe description of CVF’s) fhan fhe one in ferms of cj. Nol fo mention fhaf, for insfance, fhe consfrucfion 
of fhe CVF does depend on fhe particular choice of cj, whereas fhe consfrucfion of is universal. 

Remark 3.7. In fhe conlexl of CVF’s on a confacf manifold (A4,C) if is worlh noticing an eleganl 
correspondence belween CVF’s on Af and a cerlain class of conlrol-affine syslems on Ad. A control- 
affine system on a manifold Ad is usually underslood as a differenlial equafion of fhe form 

m 

(3.1) x = f{x) + '^v3gi{x) . 

i=l 

Here f,gi G X(Ad) are smoofh vecfor fields (/ is usually called a drift) and (u^,..., n"*)^ G are 
conlrol parameters. Trajectories of fhe confrol system ( |3.1| ) are integral curves x{t) G A{x{t)) of fhe 
affine dislribufion 

A = f + g CTM, 

where Q = (^gi,..., g^)- Nofe fhaf fhe dislribufion Q, fhe linear pari of A, is well-defined, whereas 
fhe drifl is defined only relafive fo Q, i.e., f G = f' G, whenever f — f ^ G■ Irr fhe lighl of our 
considerafions abouf CVF’s if is easy to prove fhe following facf. 

Proposition 3.8. Let (Ad,C) be a contact manifold. There is a one-to-one correspondence between 
CVF’s on Ad and controTaffme systems (equivalently, affine distributions) on Ad of the form A = X -\- 
C C TAf, where X G r(Af ). 

Indeed, fo every CVF C, we attach fhe affine dislribufion (conlrol-affine syslem) A = C ->r C. Con¬ 
versely, given an affine dislribufion (conlrol-affine syslem) A = X -|- C, fhere exisls a unique CVF 
C G r(A), namely C = C'[x]^ such fhaf A = C* -|- C = Cyx] + C. In ofher words, on every confacf man¬ 
ifold (Af,C), fhere are as many CVF’s C’s as conlrol-affine systems A = X -|- C’s, fhe correspondence 
being eslablished by fhe map A = X -|- C 1 — 
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4 Contact geometry of P(T*M) 

In this section we shall describe the natural contact structure on P(T*M) and its relation with the canon¬ 
ical symplectic structure on T*M (see, e.g., IIArn89ll or IILM871I 1. Later it will turn out that this structure 
for M = Q X M plays the crucial role in optimal control theory. 

The canonical contact structure on P(T*M). Let us denote the cotangent bundle of a manifold M by 
ttm '■ T*M M. The projectivized cotangent bundle F{T*M) is defined as the space of equivalence 
classes of non-zero covectors from T*M with [9] = [O'] if -KMiG) = and 9 = a ■ O' fox some 

scalar a G M \ {0}. Clearly, P(T*M) is naturally a smooth manifold and also a fiber bundle over M 
wifh fhe projecfion tt : P(T*M) —)■ M given by vr : [0] i—)■ txm{9)- The fiber of vr over p G M is simply 
fhe projecfive space P(T*M). If is worfh noficing fhaf P(T*M) can be also understood as fhe space of 
hyperplanes in TM (a manifold of contact elements), where we can identify each poinf [0] G P(T*M) 
wifh fhe hyperplane Tf [g] := ker 9 C 

Lemma 4.1. P(T*M) carries a canonical contact structure given by 
(4.1) C[e] = {y G T[ 0 ]P(T*M) | T 7 r(y) G ^9] = ker0} , 

for [9] G P(T*M). 

In ofher words, consisfs of all vectors in Tj 5 i]P(T*M) which projecf, under Tvr, to fhe hyperplane 
= ker0, see Fig. We shall refer to fhis confacf sfrucfure as fhe canonical contact structure on 
F{T*M). 



The facf fhaf ( |4.1| ) defines a confacf sfrucfure is well-known in fhe liferafure. The proof is given, for 
insfance, in Appendix 4 of fhe book of Arnold IIAm89L where fhe reasoning is based on fhe properfies of 
fhe Liouville 1-form Qm on fhe cofangenf manifold T*M. For convenience of our fulure considerafions 
in Secfion|^we shall, however, presenf a separafe proof quife similar fo fhe one of Arnold. 

Proof Lef R G X(M) be any smoofh vector field. We shall now consfrucf a confacf form ujr on an 
open subsef of P(T*M) for which R will be fhe Reeb vecfor field (cf. our considerafions following 
Remark!^. The soIUr = {[0] | 9{R) / 0} C P(T*M) is an open subsef and Ur projecfs under vr fo 
fhe open subsef {p \ R{p) 7^ 0} C M. In fhe language of hyperplanes, Ur consisfs of all hyperplanes 
in TM which are fransversal fo fhe given field R. Clearly fhe collecfion of subsefs Ur for all possible 
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vector fields R forms an open covering of P(T*M). The open subset Ur C F{T*M) can be naturally 
embedded as a co-dimension one submanifold in T*M by means of the map 

T*M , 

which assigns to the class [0] G Ur the unique representative 9 such that 6{R) = 1. Clearly, the natural 
projection vr : P(T*M) —)■ M, restricted to Ur, is simply the composition of ir : Ur —)• T*M and the 
cotangent projection ttm '■ T*M —)■ M, i.e., 

(4.2) tt\^^=7:m oiR ■ 

Denote by Qm the Liouville form on T*M, i.e., 'Fttm{Y)'^, for Y G T^T^M. We 

claim that the pullback cor := {IrYFIm is a contact form on Ur C P(T*M). Indeed, by definition, 
Y G T[ 5 i]P(T*M) belongs to Cjej if and only if T 7 r(y) G = ker 0. In other words, Y G C[ 0 ] if and 
only if (0, T 7 r(y)) = 0. By ( |4.2| ) for 9 = z_r([ 0]) we have 

(4.3) (0,T7r(y)) = {9,YT,M{YiR{Y))) = DMUTfR(y)) = = WR|[,j(y) . 

We conclude that C\uji= ker ojr. 

To finish the proof it is enough to check that ojr satisfies the maximum non-degeneracy condition. 
This can be easily seen by introducing local coordinates (x°, ,..., x^) on M in which R = (recall 

that R is non-vanishing on t:{Ur), so such a choice is locally possible). Let {x^,pi) be the induced 
coordinates on T*M. It is clear that in these coordinates the image iR^UR) C T*M is characterized 
by equation po = 1 and thus the Liouville form FIm = restricted to iRiplR) is simply 

-h X]r=iKdx*. Obviously, the pullback functions x* := {IrYx^ with i = 0,... ,n and pj := 
withy = 1,..., n form a coordinate system inIn these coordinates the form = {IrY^m 
simply reads as d x° -|- Pi ^ It is a matter of a simple calculation to check that such a one-form 
satisfies the maximum non-degeneracy condition. We conclude that lcr is, indeed, a contact form on Ur 
for the canonical contact structure on P(T*M). □ 

Contact transformations of P(T*M) induced by diffeomorphisms. In this paragraph we will define 
contact transformations of P(T*M) which are natural lifts of diffeomorphisms of the base M. 

Definition 4.2. Let F : M —)> M be a diffeomorphism. Its tangent map TF : TM ^ TM induces a 
natural transformation P(F) : P(T*M) —)• P(T*M) of the space of hyperplanes in TM, i.e., given a 
hyperplane R C TpM, we define the hyperplane P(F)(F) C T^(p)M to be simply the image TF('F). 
The map P(F) shall be called the contact lift of F. 

Observe that if Tf = ker 0, then TF{'H) = ker((F“^)*(0)). In other words, P(F) is the projection 
of (F“^)* : T*M —)• T*M to P(T*M) (note that (F“^)* is linear on fibers of T*M, so this projection 
is well-defined) 

nmo]) = [{F-^Yo] ■ 

It is worth noticing that P(F) respects the fiber bundle structure of tt : P(T*M) —)• M: 

(4.4) P(T*M)- — —^P(T*M) 

TT TT 

M - - -. 


We claim that 


Lemma 4.3. P(F) is a contact transformation with respect to the canonical contact structure on P(T*M). 
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Proof. Let Y be an element of T[g]P(T*M) projecting to T 7 r(y) =: Y_ under Tvr. By diagram ( |4.4| ), the 
tangent map TP(F) sends Y to an element of lying over TF(y). 

Now if Y belongs to the contact distribution C[ 0 ], i.e., see (ED, if y G ker 9, then 

((F-i)*6»,TF(y)) = (6l,TF-^TF(y)) = {e,Y_) = Q . 

Consequently, TF(y) G ker(y“^)*0, and thus TP(y)(y) belongs to which ends the proof. 

□ 

Let us remark that an alternative way to prove the above result is to show that maps the 

contact form ujr to To prove that, one uses the fact that the pullback preserves the 

Liouville form. 


CVF’s on P(T*M) induced by base vector fields. 

natural infinitesimal version. 


The results of the previous paragraph have their 


Definition 4.4. Let X G X(M) be a smooth vector field. By fhe contact lift of X we shall undersfand 
fhe confacf vector field Cx on P(T*M) whose flow is P(At), fhe confacf lifl of fhe flow of X. 

The correcfness of fhe above definition is a consequence of a simple observation fhaf fhe confacf liff 
preserves fhe composition of maps, i.e., F{F oG) = F{F) o P(G) for any pair of maps F,G : M ^ M. 
If follows fhaf fhe confacf lift of the flow At is a flow of contact transformations of P(T*M) and as such 
it must correspond to some contact vector field (cf. Definition [3]^. 

An analogous reasoning shows that given a Caratheodory TDVF Xt G X{M) and the related TD flow 
Atr ■ M —)• M, the contact lift of the latter, i.e., F{AtT), will consist of contact transformations and will 
satisfy all the properties of the TD flow. By the results of Proposition |3.3[ F{Atr) is a TD flow associated 
with some contact TDVF (see also Lemma 2.31. Obviously this field is just Cxt ■ The justification of this 
fact is left for the reader. 


Lemma 4.5. The CVF Gx A generated (in the sense of Lemma 3.4) by the NC-class of X, where X is 
any smooth vector field on P(T*M) which projects to X under Tvr, i.e., 

Cx = . 

Proof. Since P(At), the flow of Cx, projects under vr to A t, the flow of X, we conclude that X = 
YTr{Cx)- As we already know from the proof of Lemma 3.4 a CVF is uniquely determined by its class 


in NC. By ( |4.1| ), the NC-class of a field Y G X(F{T*M)) is completely determined by its Tvr-projection. 
In other words, if two fields Y and Y' have the same Tvr-projections, then y — y' is a C- value d vector 
field. T hus t he field X has the same NC-class as the CVF Cx so, by the results of Lemma 
Remark 


3.4 


3.6 1 , it follows Cx = C, 


[X]- 


(see also 

□ 


Remark 4.6. We shall end our considerations about the contact lift Cx by discussing its description in 


terms of generating functions (cf. our comments following Remark 3.6 1 . Let us choose a vector field 
Ron M and fix a contact form uip = {iR)*klM on Ur C F{T*M). Using the results of our previous 
Section and with the help of the contact form cor, the CVF Cx can be presented as Cy for some 
generating function f '■ Ur — M. This function is simply the evaluation of cor at Cx- In fact. 


m) = <^,T7r(X)) = {9,X) 


where 6 = irUO]), i.e. 9{R) = 1. In other words, the value of the generating function of Cx on the class 
[0] equals the value of the representative 9, defined by 9{R) = 1, on the field X which is being lifted. 
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Remark 4.7. It is worth mentioning the following illustrative picture which was pointed to us by Janusz 
Grabowski. Every contact structure on a manifold N can be viewed as a homogeneous symplectic 
structure on some principal GL(1, M)-bundle over N. In the case of the canonical contact structure 
on iV = P(T*M) the corresponding bundle is simply TgM, the cotangent bundle of M with the zero 
section removed, equipped with the natural action of M \ {0} = GL(1,M) being the restriction of 
the multiplication by reals on T*M. The canonical symplectic structure is obviously homogeneous 
with respect to this action. Now every homogeneous symplectic dynamics on TqM reduce to contact 
dynamics on F{T*M). For more information on this approach the reader should consult HGralSIl and 
MBGGISI . 


5 The Pontryagin Maximum Principle 

The Pontryagin Maximum Principle. A control system on a manifold Q is constituted by a family 
of vector fields f : Q x U ^ TQ parametrized by a topological space U. It can be understood as a 
parameter-dependent differential equation 


(CS) 


q{t) = f{q{t),u{t)), u{t)eu 


For a given measurable and locally bounded u{t) G U, the solution q{t) of (^i is usually called a 
trajectory of ( |CS[ ) associated with the control u{t). 

An introduction of a cost function L : Q x [/ —)• M allows to consider the following optimal control 
problem ( |OCP l 


(OCP) 


q{t) = f{q{t),u{t)), u{t)eu 

/ L{q{t),u{t)) dt —)• min . 


The minimization is performed over uf fs which are locally bounded and measurable, the time interval 
[0, T] is fixed and we are considering fixed-end-poinfs boundary condifions q(0) and q{T). By a solution 
of fhe optimal confrol problem we shall undersfand a pair (g(t), u{t)) satisfying ( |OCP| ). 

Fef now q{t) G Q be fhe frajecfory of fhe ( |CS| ) associated wifh a given confrol u{t) G U. If is 
convenienf fo regard fhe related frajecfory q{t) = {q{t), qo{t)) in fhe extended configuration space Q := 


Q X R, where qof) := /g L{q{s), u{s)) d s is fhe cosf of fhe frajecfory af time In facl, q{t) is a 


frajecfory (associated wifh fhe same confrol u{t)) of fhe following extension of (|CS|): 


(CS) 


Q{t) = f{qit),u{t)) u{t)€U, 


wifh f := (/, L • dqg) : Q x U ^ TQ = TQ x TM. Here we freaf bofh / and L as maps from Q x U 
invarianf in fhe M-direcfion in Q = Q x M. In ofher words, we exfended ( |CS| ) by incorporafing fhe 
cosfs qo{t) as additional configurafions of fhe sysfem. The evolufion of fhese addifional configurafions is 
governed by fhe cosf function L. Note fhaf fhe fofal cosf of fhe frajecfory q{t) wifh t G [0, T] is precisely 
qo{T). Since fhe laffer is fully determined by fhe pair (g(f), u{t)), if is nafural fo regard fhe extended pair 
(q(f), u{t)) rafher fhan {q{t),u{t)) as a solution of ( |OCP| . 

Note fhaf fhe exfended configurafion space Q = Q x R 3 q = {q, qo) is equipped wifh fhe canonical 
vecfor field dq^ := (0,dqq) G TQ = TQ x TM. We shall denote fhe disfribufion spanned by fhis field 
by 72- C TQ. The ray := {—rdqg | r G M+} C TZg C TgQ confained in fhis disfribufion will be 
called fhe direction of the decreasing cost at q £ Q. 

^From now on, geometric objects and constructions associated with the extended configuration space Q will be denoted by 
bold symbols, e.g., /, q, Fuq, Ht etc. Normal-font symbols, e.g., /, Ftto , Ht, will denote analogous objects in Q being, in 
general, projections of the corresponding objects from Q. 
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Regarding technical assumptions, following IIPMBG62I . we shall assume that U is a subset of an 
Euclidean space, f{q, u) and L{q, u) are differentiable with respect to the first variable and, moreover, 
f{q,u), L{q,u), ^{q,u) and ^{q,u) are continuous as functions of {q,u). In the light of Theorem 
|A.2| below it is clear that these conditions guarantee that, for any choice of a bounded measurable control 
u{t) and any initial condition q(0), equation ( CS I has a unique (Caratheodory) solution defined in a 
neighborhood of 0. If will be convenient to denote the TDVF’s q i—)■ f{q,u{t)) and q i—;■ f{q,u{t)) 
related with such a control u{t) by and fup)^ respectively. In the language of Sectiontechnical 
assumptions considered above guarantee that and are Caratheodory TDVF’s. In particular 
their TD flows Ftr : Q ^ Q and Ftr ■ Q ^ Q, respectively, are well-defined families of (local) 
diffeomorphismsj^ Note that if q{t) with t G [0, T] is a solution of ( |CS| ), then for every t,T £ [0, T] the 
map Ftr is well-defined in a neighborhood of ^(t). 

In the above setting necessary conditions for the optimality of {q{t),u{t)) are formulated in the 
following Pontryagin Maximum Principle (PMP, in short) 


Theorem 5.1 ( IIPMBG62I ). Let {q{t),u{t)) be a solution of the ( OCP| ). Then for each t £ [0, T] there 
exists a non-zero covector X{t) 
dependent Hamilton equation 


such that the curve Kt = (Q(f))'^(f)) satisfies the tinie- 


(5.1) 


At = HtiA, 


where Hf denotes the Hamiltonian vector field on T*Q corresponding to the time-dependent Hamilto¬ 
nian 


(5.2) Ht{q,X)-={\fu{t){q))- 

Moreover, along At the Hamiltonian Ht satisfies the following Maximum Principle 

(5.3) Ht{q{t),X{t)) = m&x{X{t),fy{q{t))) . 

veu 

Definition 5.2. A pair {q{t),u{t)) satisfying the necessary conditions for optimality provided by Theo¬ 
rem |5T](i.e., the existence of a covector curve satisfying ( |5.1| )-( [531 )) is called an extremal. 


Proof of the PMP. Although the PMP is a commonly-known result, for future purposes it will be 
convenient to sketch its original proof following IIPMBG621I . 

Fet {q{t), u{t)) be a trajectory of ( |CS| ). By Ftr ■ Q ^ Q, where 0 < t < t < T, denote the TD 


flow on Q of the Caratheodory TDVF f^t) defined by the control u{t) (cf. Definition 2.2 1 . In other 
words, given a point q £ Q, the curve 1 1 —)■ Fto{q) is the a trajectory of ( |CS[ ) associated with the control 
u{t) with the initial condition q(0) = q. 

The crucial step in the proof of the PMP is introducing the, so called, needle variations and the 
resulting construction of a family of set^ 


(5.4) 


Kt := cl l^Ftti [fviiqiu)) - fu{ti)iqiti))] Sti 


. i=l 


where 0 < ti < t 2 <■■■< tk < t < T is any finite sequence of regular points (see Appendix of 
the control u{-), Vi are arbitrary elements in U and 5ti are arbitrary non-negative numbers. It is easy to 

^From now on we will use symbols Ftr and Ftr to denote the TD flows of TDVF’s /„(t) and f„(t), respectively, for a 
particular control u{t). Note that Ftr projects to Ftr under tti : Q = Q x R —^ Q. 

"^In the original proof in IPMBG62I the optimal control problem with a free time interval [0, T] is considered. In this case, 
the sets iCt contain additional elements. 
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see that Kt is a closed and convex cone in well-defined for each regular point t G (0, T) of the 

control u{-). What is more, the cones Kt are ordered by the TD flow Ftr, i.e., 

(5.5) TFtAKr) C Kt , 


for each pair of regular points 0 < t < t < T. The above property allows to extend the construction of 
Kt to non-regular points of r2(-) (including the end-point T) by setting 

Kt := U FFtriKr) C T^p)Q . 

{r I T<t and r regular} 

Clearly, these sets preserve all important features of /Cj’s, i.e., they are closed and convex cones satisfying 
( |5.5[ ) for any pair of points 0 < r < f < T. 

The importance of the construction of the cone Kt lies in the fact that it approximates the reachable 
set of the control system ( |CS| ) at the point q{t). In particular, it was proved in IIPMBG62II that if at any 
point t G [0, T], the interior of the cone Kt contains the direction of the decreasing cost then the 

trajectory f i—)■ q(f), f G [0, T], cannot be optimal. 


Lemma 5.3 ( IIPMBG62I 1. If, for any 0 < t < T, the ray lies in the interior of Kt, then {q{t),u{t)) 
cannot be a solution of ( OCP|). 


As a direct corollary, using basic facts about separation of convex sets, one obtains the following 


Proposition 5.4 ( IIPMBG62II ). Assume that {q{t),u(t)) is a solution of ( |OCP[ ). Then for each t G (0, T] 
there exists a hyperplane Tit C separating the convex cone Ktfrom the ray 

Thus one can choose a curve of hyperplane^ Tit C Q separating the cone Kt from the ray 
^q(t) ^ ^ Because of ( |5.5[ ) and the fact that the canonical vector held dqg is invariant 

under TFtr (the control does not depend on the cost), we may choose TLt in such a way that 


(5.6) 


TFtr (nr) = Ut , 


for each 0 < t < t < T. Indeed, the basic idea is to choose any TLt separating Kt from and 

define Tit := TFf^^ {'^Lt) for every t G [0, T]. We leave the reader to check that such a construction has 
the desired properties. 

The geometry of this situation is depicted in Figure]^ 

Remark 5.5. Trajectories of ( |CS| ) satisfying the above necessary conditions for optimality (i.e., the ex¬ 
istence of a curve of separating hyperplanes Fit which satisfies ( |5.6| )) can be classified according fo fhe 
relafive position of fhe hyperplanes Fit and fhe line field 72. C TQ. Note fhaf, since fhe hyperplanes 
Fit evolve according fo fhe TD flow Ftr of the TDVF which leaves the distribution 72. invariant, 
we conclude that whenever "Kqir) C FLr at a particular point r G [0, T], then 'Kqp-^ C Fit for every 
t G [0, T]. We call a trajectory q{t) of ( |CS| ) satisfying the above necessary conditions for optimality: 

• normal if 'Kqp) Fit for any t G [0, T]. Note that, in consequence, the ray 22-^^^^ can be strictly 
separated from the cone Kt for each t G [0, T]. 

• abnormal if 72.q(i) C Fit for each t G [0, T]. 

• strictly abnormal if for some t G [0, T] the ray cannot be strictly separated from the cone 

Kt (and thus 'Kqp) C FLt for each t G [0, T]). 


^By choosing fCo := {0} we can easily extend Ht to the whole interval [0, T\. 
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Figure 2: Geometrically the PMP describes a family of cones Kt along the optimal solution q{t) sep¬ 
arated from the direction of the decreasing cost by hyperplanes 'Ht- Both Kt and Tit evolve 

according to the extremal vector field f^ty 


It is worth emphasizing that being normal or abnormal is not a property of a trajectory itself, but of 
a trajectory together with a particular curve of separating hyperplanes. Thus, a priori, a given trajectory 
q{t) may admit two different curves of separating hyperplanes, one being normal and the other abnormal. 
On the other hand, if q{t) is a strictly abnormal trajectory it must be abnormal (and cannot be normal) for 
any possible choice of the curve of separating hyperplanes. To justify this statement observe that if the 
ray cannot be strictly separated from the cone Kt, then necessarily (since the cones Kt are closed) 
we have —dq^ G Kt for some t G (0, T]. Consequently, also —dgq G Ht, as Ht separates —dq^ G Kt 
and —dgq G '^^(^t) Figure^. Note that since Ht is a linear space, the whole line spanned by 
the vector —dgq is contained irTHt in this case. 



Figure 3: For strictly abnormal trajectories the cone Kt cannot be strictly separated from the direction of 
the decreasing cost for some t G [0, T]. Consequently, Hq(^t) C Ht for each t G [0, T]. 


It is precisely only now when the covector X{t) of the PMP appears. Namely, one can represent each 
hyperplane Ht C as the kernel of a covector \{t) G Due to ( |5.6| ) it is possible to choose 













A contact covariant approach to optimal control 


19 


these covectors in such a way that for every 0 < t < t < T the curve A* = (q(f), A(f)) satisfies 

^t = {F,-yAr. 


This reads as the Hamilton equation ( |5.1| ) in Theorem |5.1[ Finally, the Maximum Principle ( |5.3| ) follows 
from the fact that At can be chosen in such a way that (A(t), —dq^ ) > 0 > (A(f), fc) for any k G K-t- 
The latter inequality for k = fv{(l{t)) — fu{t)(.Q{'t)) regarded for each v £ U implies ( |5.3| ). 

Note that, as we have already observed in Remark [53] for abnormal solutions, we have dq^ G Tit = 
ker A(f), and thus (A(f), dq^) = 0. For normal solutions it is possible to choose A(f) in such a way that 
(A(t), —dqq) = 1 along the optimal solution. 


The contact formulation of the PMP. Expressing the essential geometric information of the PMP (see 
Figure]^ in terms of hyperplanes Ht, instead of covectors A(f), combined with our considerations about 
the canonical contact structure on F{T*M) (see Section]^ allows to formulate the following contact 
version of the PMP 


Theorem 5.6 (the PMP, a contact version). Let {q{t),u{t)) be a solution of the ( |OCP| ). Then for each 
t G [0, T] there exists a hyperplane Tit G P(T*^j^Q) such that the curve 1 1 —)■ Tit satisfies the equation 


(5.7) 




where denotes the contact TDVF on P(T*Q), being the contact lift of the TDVF fnp) on Q (see 

Definition 4.4 and Lemma 4.5). 

Moreover, each Fit separates the convex cone K-t defined by \5.4\ from the ray 


Proof. The family of hyperplanes Fit separating the cone /C* from the ray and satisfying ( |5.6| ) 

was constructed in the course of the proof of Theorem |5.1| sketched in the previous paragraph. To end 
the proof it is enough to check that Fit evolves according to ( |5.7| ). From ( |5.6| ) and Definition 4.2 of the 
contact lift we know that Fit evolves according to P(F.rt)- By the remark following Definition 4.4 this 


is precisely the TD flow induced by the TDVF Cf. 


u(t) • 


□ 


Let us remark that the contact dynamics ( |5.7| ) are valid regardless of the fact whether the considered 
solution is normal or abnormal. We have a unique contact TDVF on P(T*Q) governing the 

dynamics of the separating hyperplanes FLt- The difference between normal and abnormal solutions lies 
in the relative position of the hyperplanes FLt with respect to the canonical vector field —dq^ on Q. 

Remark 5.7. Actually, the fact that the evolution of FLt is contact (and at the same time that the evolution 
of At is Hamiltonian) is in a sense “accidental”. Namely, it is merely a natural contact (Hamiltonian) 
evolution induced on P(T*(5) (on T^Q) by the TD flow on Q defined by means of the extremal vector 
field. In the Hamiltonian case this was, of course, already observed - see, e.g.. Chapter 12 in HAS041 . 
Thus it is perhaps more proper to speak rather about covariant (in terms of hyperplanes) and contravari- 
ant (in terms of covcctors) formulations of the PMP, than about its contact and Hamiltonian versions. It 
may seem that the choice between one of these two approaches is a matter of a personal taste, yet ob¬ 
viously the covariant formulation is closer to the original geometric meaning of the PMP, as it contains 
a direct information about the separating hyperplanes, contrary to the contravariant version where this 
information is translated to the language of covectors (not to forget the non-uniqueness of the choice 
of At). In the next Section we shall show a few applications of the covariant approach to the sub- 
Riemannian geometry. Expressing the optimality in the language of hyperplanes allows to see a direct 
relation between abnormal extremals and special directions in the constraint distribution. It also provides 
an elegant geometric characterization of normal extremals. 
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Although equation ( |5.7| ) has a very clear geometric interpretation it is more convenient to avoid, 
in applications, calculating the contact lift. Combining ( |5.6[ ) with Theorem 2.10 allows to substitute 
equation (|5.7[) by a simple condition involving the Lie bracket. 


Theorem 5.8 (the PMP, a covariant version). Let {q{t),u{t)) be a solution of ( |OCP I. Then for each 
t G [0, T] there exists a hyperplane Tit C such that the curve t i —Tit satisfies the equation 

( |5.6[ ). Equivalently, Tit A a curve of hyperplanes which is a charming distribution that is fi^pyinvariant 
along q{t), i.e., 

(5.8) [fu(t),'Ht]q(t) C Ht fora.e. t G [0,r] . 

Moreover, each ELt separates the convex cone K-t defined by < \5A^ from the ray 

Proof The proof is immediate. The existence of separating hyperplanes ELt satisfying ( |5.6| ) was already 
proved in the course of this section. The only part that needs some attention is the justification of equation 


_ |). It follows directly from the i^tT--invariance along q{t) of ELt and Theorem 2.10 (Note that ELt is 

charming along q{t) by Proposition [2^) 


□ 


Finally, let us discuss the description of the contact dynamics ( |5.7| ) in terms of natural contact forms 
introduced in the proof of Lemma 4.1 Recall that a choice of a vector field R G X(Q) allows fo define 
a nafural embedding of fhe open sef Ur = {[0] | 0{R) 7 ^ 0} C P(T*Q) info T*Q (recall fhaf in fhe 
language of hyperplanes, fhe sef Ur consisfs of fhose hyperplanes in TQ which are fransversal fo fhe 
field R). Whaf is more, fhe pullback ojr of fhe Liouville form LLq on T*Q, is a confacf form on Ur. By 


fhe commenf of Remark 
ojR is simply 

n^*Q)L)UR3{q,[\]) 


4.6 fhe generafing function of fhe CVF associated wifh fhe confacf form 


{\fu{t){q)) - Ht{q,X) G 


where A is a represenfafive of fhe class [A] such fhaf (A, F?) = 1. In ofher words, fhe generafing function 
of fhe confacf dynamics ( |5.7| ) associafed wifh ur is precisely fhe linear Hamilfonian ( |5.2 1 . 

In particular, by choosing R = dq^ we can easily recover fhe resulfs of BOhslSIl . Note fhaf R = dq^ 
is fhe canonical choice of a vecfor field fransversal fo all hyperplanes ELt^ in fhe normal case (note 
fhaf addifionally R = dq^ is Ftr-invarianf). For such a choice of R, fhe corresponding embedding 
^ T*Q is consfrucfed simply by selling (A, dq^'^ = 1, which is jusl fhe slandard normalization 
of fhe normal solution. The associated confacf form is ujr = vr^ d go + where fig is fhe Liouville 

form on T*Q and tti : Q x M —)■ Q and 7 r 2 : Q x M —)■ M are nafural projections. As observed above. 


fhe generafing funcfion of fhe confacf dynamics associated wifh ujr is fhe linear Hamiltonian ( |5.2| ). This 
stays in a perfect agreement with the results of Section 2 in HOhslSlI . 

For the abnormal case there is no canonical choice of the field R fransversal fo fhe separafing planes. 
Yel locally such a choice (buf nol canonical) is possible. The resulling generafing funcfion of fhe confacf 
dynamics (|5.7|) is again fhe linear Hamiltonian (|5.2|). 


6 Applications to the sub-Riemannian geodesic problem 


In this section we shall apply our covariant approach to the PMP (cf. Remark [577] ) to concrete problems 
of optimal control. We shall concentrate our attention on the sub-Riemannian (SR, in short) geodesic 
problem on a manifold Q. Our main idea is to extract, from the geometry of the cone Kt, as much 
information as possible about the separating hyperplane ELt and then use the contact evolution (in the 
form (^1 or (|5.8|)) to determine the actual extremals of the system. 
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A sub-Riemannian geodesic problem. To be more precise we are considering a control system con¬ 
stituted by choosing in the tangent space TQ a smooth constant-rank distribution V C TQ. Clearly 
(locally and non-canonically), by taking f{q,u) = where u = ..., and 

V = (/i,..., fd), we may present V as the image of a map f : Q x U ^ TQ where U = with 
d := rankD, is an Euclidean space, i.e., a control system of type ( |CS| ). We shall refer to it as to the 
SR control system. In agreement with our notation from the previous Sectionwe will write also fu{q) 
instead of /(g, u) G Vq. 


The SR geodesic problem is an optimal control problem of the form ( |OCP[ ) constituted by considering 
a cost function L{q, u) := \g{fu{q)-, fu{q)), where 


g 


VxV 


is a given symmetric positively-defined bi-linear form {SR metric) on T>. 

In the considered situation, after passing to the extended configuration space Q = Q x'R3 {q,qo) 
q, the extended control function f : Q x U ^ TQ = TQ x TM is simply 

f{Q,u) = fu{q) = fu{q) + ^g{fu{q), fu{q))dqo . 


Definition 6.1. By a SR extremal we shall understand a trajectory {q{t),u{t)) of the SR control system 
satisfying the necessary conditions for optimality given by the PMP (in the form provided by Theorem 
5.1 or, equivalently. Theorem |5.61 or Theorem |5.SI). 


The geometry of cones and separating hyperplanes. Observe that the image /(q, U) C TqQ = 
TqQ X TqgM is a paraboloid (see Figure^. The following fact is a simple consequence of ( |5.4| ). 



Figure 4: In the sub-Riemannian case the image /(q, U) is a paraboloid in TgQ. 


Lemma 6.2. Let {q{t),u{t)) be a trajectory of the SR control system and let K-t be the associated 
convex cone defined by formula Then K-t contains the tangent space of the paraboloid f{q{t), U) 

at i-e-, 


(6.1) 


{Y + g{hit),Y)dqq\Y eVqp)] CKt . 
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Proof. It follows from ( |5.4| ) (after taking k = \, ti = t, and thus Fu^ = idg) that Kt contains every 
secant ray of the paraboloid/(q(t), (7) = {f^{q{t)) \ fv{q{t)) G 

passing through the point {Q{t))- Using these secant rays we may approximate every tangent ray of 
the paraboloid U) passing through /a(t)(q(f)) with an arbitrary accuracy. Since K-t is closed, it 

has to contain this tangent ray and, consequently, the whole tangent space of f{q{t), U) at 
(see Figure]^. The fact that this tangent space is described by equality ( |6.1[ ) is an easy exercise. □ 


/ 



Figure 5: Since the cone Kt contains all secant rays M+ • {fv{q) — /ffi(t)(Q)} and is closed, it must 
contain the tangent space T U). 


Remark 6.3. In general, for an arbitrary control system and an arbitrary cost function, the cone Kt 
contains all secant rays of the image f{q{t),U) passing through fu{t){Q{t))- Thus, after passing to 
the limit, the whole tangent cone to f{q{t), U) at fu{t){q{'^)) is contained in Kt- If f{q{t), U) is a 
submanifold, as it is the case in the SR geodesic problem, this tangent cone is simply the tangent space 

Here is an easy corollary from the above lemma and our previous considerations. 


Lemma 6.4. Let {q{t),u{t)) be a SR extremal and let Lit C Fq(^t)Q be a curve of separating hyper¬ 
planes described in Theorem 5.8 Then for each t G [0,T], the hyperplane Tit contains a rankP- 
dimensional linear subspace 


{Y + gUu{tpY)dq,\Y CUt . 

If additionally, {q{t),u{t)) is an abnormal SR extremal, then for each t G [0, T] there exists a 
hyperplane %t <G T^p^Q containing and such that the curve t TLt along q{t) is subject to the 
evolution equation 

(6.2) TEtTfar) = T~Lt , for each 0 < t < t < T. 

Here Ftr denotes the TD flow of a Caratheodory TDVF f^t)- 

Equivalently, TLt curve of hyperplanes containing which is a charming distribution that is 

fu(t)-invariant along q{t), i.e.. 


(6.3) 


lfu(t),'bit]g(t) C Tit fora.e. t G [0,r] 
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Proof. To justify the first part of the assertion, observe that if, in a linear space V, a hyperplane H C V 
supports a cone K. C V which contains a line I C /C (and all these sets contain the zero vector), then 
necessarily I C % (each line containing 0 either intersects the hyperplane or is tangent to it). Since, by 
Lemma 6.2 Kt contains a subspace {Y + g{fmpY)dqQ \ Y G we conclude that this subspace 

must lie in PCt. 

Assume now that the considered extremal is abnormal. In this case, as we already observed in Re¬ 
mark 5.5 PCt contains, in addition to the above-mentioned linear subspace, also the line 'R-qp) and thus 
we conclude that 

® Rqo{t) C 'Rt ■ 

Since {Og} © TZg^ is the kernel of the natural projection Tvri : TQ —)■ TQ, we conclude that, for every 
t G [0,r], the image Tit of PCt under this projection is a hyperplane in Tqp)Q which contains 'P’q[t)- 
Obviously, since projects to fm)’ equation ( |5.6| ) implies ( |6.2| ). By Theorem 2.10 equation (Ol is 


the infinitesimal form of the latter. 


□ 


It turns out that in some cases the above basic information, suffices fo find SR extremals. Let us study 
the following two examples. 


Example 6.5 (Riemannian extremals). In the Riemannian case D = TQ is the full tangent space and g 
is a Riemannian metric on Q. Let us introduce any connection V on Q compatible with the metric. By 
T\/{X, Y) := VxT — VyA — [X, Y] denote the torsion of V (in particular, if we take the Levi-Civita 
connection V = then T^lc = 0). 

In this case rank V = dim Q so a Riemannian extremal cannot be abnormal from purely dimensional 
reasons: by Lemma 6.4 in such a case a (dim Q—1)-dimensional hyperplane Tit C Tqp^Q would contain 


a bigger (dimQ)-dimensional space 'Dgp) = T^q{t)Q< which is impossible. Thus every Riemannian 
extremal must be normal and, by Lemma [64} necessarily 


nt = {Y + g{hit),Y)dqq I y G T,(*)Q} . 

Now any Tif-valued vector field along q{t) fakes fhe form Y{t) + g{fmpY{t))dqq where Y{t) G 
Tqp)Q. Ifs Lie brackef wifh fhe exfremal vector field f^t) = + Igihp), fu(t))dqq is simply (in fhe 

derivations we use the property that Xg{Y^ Z) = giVxY’i Z) + g{Y^ XxZ) for every metric-compatible 
connection) 


fu(t) 2^ 

[/s(t)>^] + {9Xh^t)fu{t),y) + 9ifu{t)Xh(t)Y) - 9{fu{t)XYfu{t))}dqo = 

[/S(t),i"] +5(/s(i), [fuit),Y])dqq + {9Xf,,,,f^t)X) + 9{fmX^{fuit),Y))}dqq . 

By ( |5.8[ ) fhis Lie brackef should be "Ht-valued, and since [fmpY] + 9{fu{t): [fu(t)jY])dqo belongs fo 
nt, we conclude fhaf fhe considered brackef belongs fo PCt if and only if for any Y G Tqp)Q we have 

9Xh(t)fu{t),Y) + 9{fu{t),T\/{fu{t),Y)) = 0 . 

In fhis way we have expressed fhe geodesic equafion for fhe mefric g in ferms of fhe chosen mefric- 
compafible connection V wifh forsion Ty. In case fhaf V = is fhe Levi-Civifa connection, fhe 
torsion vanishes and we recover fhe sfandard geodesic equafion 


LC 

V /2(t)/e(d = 0 . 
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Example 6.6. Consider an abnormal SR extremal {q{t),u{t)) in a particular case of the SR geodesic 
problem where V C TQ is a co-rank one distribution. By Lemma [6^ in such a situation necessarily 
Tit = since the latter space is already of co-dimension one in Now ( |6.3| ) gives us 


[fu{t)i'^q{t)]q{t) C T^q(t) 


for almost every t G [0, T], i.e., in the considered case any abnormal extremal has to be a characteristic 
curve of V. The converse statement is also true. Indeed, the reader may check that in this case 'Ht := 
'^q{t) ® ’^90 p) is the curve of separating hyperplanes containing and satisfying the assertion of 

Theorem |5.8| (see also the proof of Theorem |6.7| ). 

In the following two subsections we shall discuss normal and abnormal SR extremals in full generality. 


6.1 Abnormal SR extremals 

Our previous considerations allow us to give the following characterization of SR abnormal extremals. 


Theorem 6.7. For the SR geodesic problem introduced above the following conditions are equivalent: 

(a) The pair {q{t),u{t)) is an abnormal SR extremal. 

(b) The smallest distribution Ftr-invariant along q{t) and containing i.e., 

F',{T>)qp) = vectR{TFt^(y) | Y G Fgpp 0 < r < T} 


is of rank smaller than dim Q. Here Ftr denotes the TDflow (inQ) of the Caratheodory TDVF 
Moreover, condition depends only on and T> along q{t). 


Note that Theorem |6.7| reduces the problem of finding abnormal SR extremals to the study of the 
minimal distribution FtT--invariant along q{t) and containing T)gpy Often, if q{t) is sufficiently regular, 
this problem can be solved by the methods introduced in Lemma 2.11 which are more practical from the 
computational view-point. 


Corollary 6.8. Let A be a C^-smooth P-valued vector field and lef q{t) wifh t G [0, T] be an integral 
curve of X. Then q{t) is a SR abnormal exfremal in fhe following fwo (non-exhausfive) sifuafions: 


• The disfribufion spanned by fhe iferafed Lie brackefs of X wifh all possible smoofh P-valued 
vector fields, i.e.. 


ad^(p) = (ad^(y) I y G r(p), k = 0 , 1 , 2 ,...) 

is of consfanf rank r along q{t) and r < dim Q. 

• There exisfs a smoofh disfribufion B D V on Q of consfanf co-rank af leasf one, such fhaf 

[X, B]gp^ C Bgp) for any t G [0, T]. 


The above facf follows direcfly from Theorem |6.7[ Lemma 2.11| and Theorem 2.10 In each of fhe 
fwo cases along q{t) we have a consfanf rank smoofh (and fhus charming) disfribufion which confains 
V, is X-invarianf along q{t) (and fhus by Theorem 2.10 also T),--invarianl along q{t)) and of co-rank af 
teas! one. Clearly such a disfribufion musf confain F,{V)qpy which in consequence also is of co-rank af 
teas! one. 
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Remark 6.9. In Sec. 7.3 in ||Zhi951 Zhitomirskii considered the following 2-distribution on 

V = {X = dx,Y = dyi + xdy2 -I- -h hi{x))dy3 + {x{y^f + h2{x))dy^)) , 

where (x, y^) are coordinates on and smooth functions hi and h 2 satisfy the conditions 

f hi (x) = 0 for X < 0 

\hi{x) / 0, h[{x) / 0, /i"(x) / 0 for X > 0 


and 

J h 2 {x) = 0 for X > 0 

\h 2 {x) / 0, h 2 {x) / 0, /i 2 (x) / 0 for X < 0 . 

Zhitomirskii proved that the curve (—e, e) 3 t (f, 0, 0, 0, 0) G (which is obviously an integral 
curve of X) is not an abnormal SR extremal, yet, as he claims, the distribution ad“ {V) regarded in the 
above corollary is of constant rank r = 4 < 5 along this curve. A detailed study of this example reveals, 
however, that along the investigated curve, r = 4 apart from the point (0,0,0,0,0), where the rank drops 
down to 3. Thus the discussed example does not contradicts Corollary |6.8[ as the regularity condition is 
not matched. In fact, the considered curve consists of two pieces of abnormal SR extremals (for f > 0 
and f < 0) which do not concatenate to a single SR abnormal extremal, even though the concatenation 
is C'^-smooth. This example shows that the condition r < dimQ in Corollary 6.8 is not sufficient 
(although it is necessary in the smooth case). 


Proof of Theorem [6^ If {q{t),u{t)) is an abnormal extremal then, by the results of Lemma 6.4 Tit, 
the TQ-projection of the curve of supporting hyperplanes Tit C T^p-^Q, is a curve of hyperplanes 
along q{t) (i.e., a distribution of co-rank one along q{t)), it contains 'Dqp) nnd is T),--invariant along 
q{t). In particular, it must contain the smallest distribution F*,--invariant along q{t) and containing V (cf. 
Proposition |2^. Thus rankF,(P)qp) < rank 7ft = dimQ — 1. 

Conversely, assume that rank F, {'D)q(t) < dim Q. Now by adding (if necessary) to F, {'D)qp^ several 
vector fields of the form Fttq (26) where X G Tq^Q^Q, we can extend F,{'D)q{£^ to 7ft, a co-rank one 
distribution Ftr-invariant along q{t). Define now the curve of hyperplanes 7ft := 7ft © 72.^^ it) C 
Tq(t)Q- We claim that 7ft is a curve of supporting hyperplanes described in the assertion of Theorem 
5.6 Indeed, the Ft,--invariance of 7ft should be clear, as on the product Q = Q x M the TD flow 
Fir takes the form Ftr{q,qo) = {FtriQ), Btr{qo))^ for some TD flow Btr on M. Clearly, since 7ft is 
FtT--invariant along q{t), the tangent map of Ftr preserves 7ft = 7ft © Tlq^py To prove that 7ft indeed 
separates the cone /Ct from the direction of the decreasing cost 72-”^^^ observe that any vector of the form 
fviQit)) - where G lies in Fg(t) ©7^,„ (t) C Tft- Moreover, any vector of the form 

TFt^ [fv{q{T)) - fu{t){(l{r))], where G Fg(^), lies in TFt^(Fg(^) © 7^,^(^)) C TFt^(7fr) C 7ft. 
Thus, the whole cone /Ct is contained in 7ft (cf. formula ( |5.4| )). Since also C © 7ft, we 

conclude that indeed 7ft separates /Ct from (in a trivial way). 

Finally, to justify the last statement of the assertion we can use Theorem 2.10 to express the Ftr- 
invariance o f := along q{t) as the /^^(t)"invariance of the latter distribution, and then 

use Lemma 


2.13 


(for Bqp) D Fg(t) 9 /n(t)(7(f))) to prove that this invariance depends on f^y and 
F, (F)q(t) along q{t) only. Now it is enough to check that F, {V)qp-j itself does not depend on a particular 
choice of the extension of /s(t)((7(f)) to a neighborhood of q{t). Assume thus that f^/p^ is another 
extension of /s:p)(g(f)), that Fir is the related TD flow, and that Fl{'D)qp'^ is the minimal distribution 
Fir -invariant along q{t) and containing Fqpy Now repeating the reasoning from the proof of Lemma 
2.13 we would get 

[fup), F,{V)]qp) = [fii,pyF,{V)]qp^ mod Vqp) . 
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Since [fu(t), F,{V)] a) C F,{V)qp) C F,{V)qpp we get [/n'(t),C F,{D)q(p 

which, by Theorem 2.10 implies that F,{D) is respected by the TD flow Fir along q{t). From the 
minimality of FK'D)^^^ we conclude that C F,{V)qf^iy Yet, for intertwined /gp) and 

we would get the opposite inclusion in an analogous manner. Thus F,{'D)qpy = F^(P)gp), and so it 
does not depend on the choice of the extension of . This ends the proof. □ 


Examples. 

Example 6.10. Let V C TQ be a smooth rank-2 distribution with the growth vector (2, 3, 4,...). Let 
y, Z be a local basis of sections of V. From the form of the growth vector we conclude that the fields 
Y, Z and [Y, Z\ are linearly independent, while the distribution 

(y,z, [y,z],[y,[y,z]],[z, [y,y]]> 


is of rank 4. Thus the fields [Y, [Y, Y]] and [Y, [Y, Y]] are linearly dependent relative to the distribution 
^Y, Y, [Y, Y]), i.e., there exist smooth functions (/), V’ : Q —)■ K such that 

(6.4) 0[Y, [Y, Y]] + [Y, Y]] = 0 mod (Y, Y, [Y, Y]) . 


We claim that the integral curves of the line bundle ^(/)Y + ?/)Y) C P are SR abnormal extremals (notice 
that (/)Y + y)Y G 2 ? is a characteristic vector field of 2 ? + [ 2 ?, V]). To prove this we shall use the results 
of Corollary |6.8| Indeed, it is easy to check, using (6Ai, that for X = cpY + V'Y the smallest distribution 
adx-invariant and containing T> is simply the 3-distribution (Y, Y, [Y, Y]). This agrees with the results 
of Prop. 11 in llLS95i and Sec. 9 of llZhi951 . 


Example 6.11 (Zelenko). The following example by Igor Zelenko HZelOhll become know to us thanks to 
the lecture of Boris Doubrov. The interested reader may consult also IIAZ061IDZ12I . 

Consider a 5-dimensional manifold M with a 2-dimensional distribution B C TM of type (2, 3, 5). 
That is, locally B is spanned by a pair of vector fields Xi and X 2 such that 


Yi, Y 2 , Y 3 :=[^i,^ 2 ], Y 4 :=[Yi,Y 3 ] and ^ 5 := [^ 2 , 263 ] 

form a local basis of sections of TM. Consider now the bundle Q := P(i3) C P(TM) —)■ M of 
lines in B, being a 6 -dimensional manifold and a P^M-bundle over M. Introduce an affine chart [1 : f] 
corresponding to the line M • {Yi + tX 2 ] on fibers of Q —)• M and define a 2-dimensional distribution 
V := ldt,X\ + tX 2 ) on Q. Our goal is to find abnormal SR extremals for this distribution. We will use 
Corollary |6. 8 1 for this purpose. 

First let us show that the integral curves of dt are abnormal extremals. Indeed, it is easy to see that 
[dt, Xi + tX 2 ] = Y 2 and that [dt, X 2 ] = 0, i.e., the minimal distribution 9f-invariant and containing V 
is precisely (^dt, Yi, Y 2 ). This distribution is of constant rank smaller than 6 = dimQ, so by Corollary 
| 6 . 8 [ indeed, the integral curves of dt are abnormal extremals. 

It is more challenging to find a second family of abnormal extremals of V. Let us look for such a 
family being the integral curves of the field 22 = Yi + tX 2 + Fdt, where F is some, a priori unknown, 
function on Q. To calculate the minimal distribution 22-invariant and containing V it is enough to 
consider iterated Lie brackets ad%(dt). Skipping some simple calculations one can show that the vector 
fields 

dt, 22 , adn (dt) = [H,dt], adj^idt) and ad^jj{dt) 

span a 5-dimensional distribution V on Q. Denote [Yj, Yj] := J2k=i for i,j = 1,..., 5. Then 
the Lie bracket ad\j{dt) belongs to V if and only if 

F = —/i4 + (/i4 — 2/|4)f + ( 2 /I 4 — ■ 
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In such a case, Visa constant rank distribut ion c ontaining V and closed under adni') (i-e., 77-invariant). 

the integral curves of H (for F as above) are abnormal 


Since rank D = 5 < dim Q, by Corollary 
SR extremals related with V. 


6.8 


Example 6.12 (Strongly bracket generating distributions). Recall that a distribution V C TQ is called 
strongly bracket generating (SBG, in short) if for any p ^ Q and any X G T{V) non-vanishing at p we 
have 

Vp -|- [X, V]p = TpQ . 

In the light of Corollary |6. 8 1 it is clear that a SR geodesic problem related with a SBG distribution does 
not admit any abnormal SR extremal. 

In fact, the same conclusion holds for a weaker version of the SBG condition, i.e., it is enough to 
assume that 

Vp + [X, V]p + [X, [X, V]]p + .. . = TpQ 
for any X G r(P) non-vanishing at p. 

Example 6.13 (Submanifold). Assume that 5 C Q is a submanifold of co-dimension at least one and 
such that V\^ <Z T5. Then any ACB curve t i—)• q{t) tangent to V and contained in S is an abnormal 
extremal. Indeed, in this case is obviously a charming distribution F^T-invariant along q{t) which 

contains and is of co-rank at least one in Tgp)Q. Thus 

C Tq(t)5 

and, consequently, F,{V)qp^ is of rank smaller than dimQ. By Theorem 
extremal. 

Example 6.14 (Zhitomirskii). Let D be a 2-distribution on a manifold Q such that V"^ ■.= V + [V, V] is 
of rank 3. In llZhi 9.511 Zhitomirskii introduced the following definition. 

A distribution Z C TQ of co-dimension 2 is called nice with respect to V if 

• Z is involutive 

• for any q e Q we have Vq % Zq 

• rank(77^ n Z) = 2. 

In this case the intersection £ := P n Z is a line distribution. We shall show that the integral curves 
of C are abnormal SR extremals. Indeed, observe that V"^ = V‘^ Z + V and thus 

%:= Z FV"^ = Z + V 


6.7 


q{t) is an abnormal 


is a smooth co-rank-one distribution in Q. Clearly 77 C 77 and, what is more, given any section X G 
r(£) we have [X, 77] C 77. Indeed, take any 77-valued vector field Y. Since Ti = Z + V we can 
decompose if (in a non-unique way) as X = Xi -|- I 2 where Xi G r(Z) and X2 G r(77). Now [X, X] = 
[X, Xi] -|- [X, X2]. Clearly [X, Xi] G r(Z), since X and Xi are Z-valued and Z is involutive. Moreover 
[X,X2] G r( 772 ), as both X and X2 are 77-valued. We conclude that [X, X] = [X, Xi] -f [X, X2] G 
r(z + 772 ) = r(77). 


Now it should be clear that the smallest distribution containing 77 and invariant with respect to to the 
TD flow of X is contained in 77, which is of co-rank one. Thus, by Theorem 6.7 the integral curves of 
X are abnormal SR extremals. 
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6.2 Normal SR extremals 


Observe first that the extremal vector field is normalized by g{fu{t)^ fu(t)) = 1 along every solu¬ 
tion of the SR geodesic problem. Indeed, this follows easily from the standard argument involving the 
Cauchy-Schwartz inequality. From now on we shall thus assume that the extremal vector field is 
normalized in a neighborhood of a considered trajectory q{t). This assumption allows for an elegant 
geometric characterization of normal SR extremals in terms of the distribution 

Kit) ■■={y^^,it)\9iY,hp)) = o} 

consisting of those elements of V which are p-orthogonal to along q{t). Note that is a subdis¬ 
tribution of V along q{t). 

Theorem 6.15 ( IIAOP97I ). Assume that the field is normalized, i.e., gifup)^ fu{t)) = 1- Then, for 
the SR geodesic problem introduced above, the following are equivalent: 

(a) The pair {q{t),u{t)) is a normal SR extremal. 


(b) The velocity fu(t){tl{t)) of class ACB with respect to t, and the smallest distribution Ftr-invariant 
along q{t) and containing T>^^^y i.e.. 


F,{T>-^)q{t) = vectR{TFt^(y) | Y € g(Y, f^t)) = 0, 0 <r<T} 


does not contain fup){Q{'t)) far any t G [0,T]. Here Ffr denotes the TD flow (in Q) of the 
Caratheodory TDVF fap). 


Theorem 3.1 of IIAQP97II contains a formulation of the above result equivalent to ours. 

Again if q{t) is sufficiently regular we can use the method introduced in Lemma 2.11 to check con¬ 
dition Q in the above theorem. The result stated below can be easily derived from Theorem 6.15 using 
similar arguments as in the proof of Corollary |6.8| For the case rank I? = 2 it was proved as Theorem 6 
in ISlll. 


Corollary 6.16. Let X be a C^-smooth P-valued vector field and let q{t) with t G [0, T] be an integral 
curve of X. Then q{t) is a SR normal extremal in the following two (non-exhaustive) situations: 


• The distribution spanned by the iterated Lie brackets of X and all possible smooth 27-valued vector 
fields p-orthogonal to X, i.e., 

ad^(27^) = (ad^(y)|yGr(27), g(X,Y) = 0, A: = 0,1, 2,... > 


is of constant rank r along q{t) and it does not contain X{q{t)) for any t G [0, T]. 


• There exists a smooth distribution B on Q, such that 


[X,B]g^t) C Bgp) , x{q{t)) ^ Bgp) aud vK C ^g(t) for any t G [0,r]. 


Proof of Theorem \6.15\ Assume first that {q{t),u{t)) is a normal SR extremal. Let Fit C Tqp)Qbethe 
related curve of separating hyperplanes given by the PMP Note that, since Fit for each t is a hyperplane 
transversal to the line TZgp'j C it must be of the form 


Fit = {Y + at{Y)dgg\YGTgp)Q}, 


where at : Tg{t)Q 


is a linear map. Using the results of Lemma 


6.4 


we know that aJ 


V, 


<i(t) 


fuit){q{t))-‘g, i-e., at{fu(t){Q{t))) = 1 and 27^(^) c keiat. In particular, fu(t){Q{t)) is transversal to 


kerat D D 


gp)' 
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Now, since fm'j = (here we use the normalization of it is clear that 


Ftriq^qo) 


iFtr{q),qo + 


^)) ■ 


It follows that TFtr [1^ + ar{Y)dqq] = TFtriY) + ar{Y)dqQ, for every f, r G [0, T] and Y G Tq(^^^Q. 
Since TFtr (FLr) = FLt, the above vector must be of the form X + at{X)dqq, where X = TFtr{Y). 
That is, at{TFtr{X)) = ar{X). In particular, f i—)• at is continuous and, moreover, TFtr (kera,-) = 
ker at for every f, r G [0, T]. We conclude that ker at is a distribution along q{t) which is FtT--invariant, 
contains and is transversal to fu{t){q{'t))- Clearly, F,{D-^)qp-^ C ker a* and thus it is also transver¬ 


sal to /s(t)(g(f)). 

To prove that t i—)■ fu^piqit)) is ACB, observe first that = ker a* n Dqp-j admits locally a 
( 7 -orthonormal basis of ACB sections. Indeed, ker at is charming since it is Ft,--invariant (cf. Propo¬ 
sition 2.51. Let now {Ai,..., Xn-i} be a local basis of ACB sections of ker a* along q{t). Choose a 
minimal subset of this basis, say {Ai,..., A^}, such that (Ai, ..., = ker at for every 

f in a relatively compact neighborhood of a given point to £ [Oj ^]- Extend locally the SR metric g to 


a metric g on ker at by taking g\ 


9\ 

?(t) 


V 


liO 


and by setting vectors Ai,..., A^ to be ^-orthonormal 


and ^r-orthogonal to Clearly, this new metric is ACB in the considered neighborhood of to. Now 

we can apply Lemma A.4 to the ACB basis {Ai,..., An-i} and obtain an ACB ^-orthonormal basis 
{Ai,..., As, As+i,..., Yji-i} of ker a*. Clearly, by the construction of the Gram-Schmidt algorithm, 
{As-i-i) • • • j Yn-i} is a g-, and thus also a p-orthonormal basis of (the relative compactness of the 
neighborhood is used to assure that the ^-lengths of sections Aj are separated from zero). 

Now let us choose any ACB section Yn of 'Dqp-j which is transversal to Again using Lemma 
we modify the ACB local basis {Ts+i,..., Yn-i,Yn} of 'Dqp'^ to a p-orthonormal ACB local basis 
{Ys^i,... ,Yn-i,Yn}. Obviously, y„(q(t)) is a (/-normalized vector ^-orthogonal to = (1^^+!,..., 


A.4 


Yn-l), 


thus Yn{q{t)) = ±fu(t){q{t))- Now at{Yn{q{t))) = ±at{fu(t){q{t))) = ±1- And since both at and 
Yn{q{t)) are continuous with respect to t the sign ± must be constant along [0, T]. We conclude that 
t ^ fu{t){q{t)) is ACB alike t ^ Yn{q{t)). 


Conversely, assume that Q holds. The condition that the velocity curve t i—)• fu[t){q{'t)) is of class 
ACB, together with the normalization condition, imply that the velocities are preserved by the the flow 
Ftr up to F,(T>-L)qp)-terms, i.e.. 


(6.5) TFt^ [fu(T){qi'r))] = fu{t){qit)) mod F,{V-^)qp-j for every f,r G [0,r]. 

Indeed, denote q{t) = fu{t){q{t)) = EiLi W/*(9(f)), where {/i, / 2 , ■ • •,/d} is a local basis of 
smooth sections of V and where controls u^{t)’s are, by assumption, ACB with respect to t. In par¬ 
ticular, the derivatives u^{t) are a.e. well-defined, bounded, and measurable with respect to t. We can 
differentiate q{t) to get 


q{t) = 


2 = 1 


dfi{q) 

dq 


<?{*) 


qit) + '^'F{t)fi{q{t)) = 


2=1 


dfu{t){q) 

dq 


Qit) 


q{t) + ^u^{t)fiiqit)) 


2=1 


i.e., the velocity q{t) is a solution of an non-autonomous linear ODE with the right-hand side depending 
measurably on time. Note that the homogeneous part of this equation is precisely equation ( |A.4| ) describ¬ 
ing the tangent map of the flow of f^t)iq) along q{t). By the general theory of linear ODEs, for every 
t, r G [0, T] we have 


q{t) = TFtr[q{T)] + 



■ d 

^u\s)fi{q{s)) 

_ 2=1 


ds . 
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By the normalization condition vector (s) fi{q{s)) is 5 r-perpendiculartog(s) = Yli=i ^*('®)/*(9('5))> 

and hence the integral part of the above equation belongs to This proves ( |6.5| ). 

The crucial step now is to build, along the projected trajectory q{t) £ Q, a splitting = 

^q{t) ® (/e(t)(9(^)))^ where Bqp-^ is a co-rank one distribution along q{t), which is Tj,--invariant along 
q{t) and contains Such a can be constructed by adding, if necessary, to F,(P-*-)gp) several 

vector fields of the form Fio(Xi), where Xj G Tg(o)Q together with Tji(Q)(q(0)) are independent. Clearly, 
in this way we can build Bqp^ which is F*,--invariant along q{t), of co-rank one and contains The 

fact that Bqp'j and /n(t)(q(f)) are linearly independent follows directly from condition (|^. Now we can 
construct the curve of separating hyperplanes 'Ht at q{t) by the formula 

'Ht := (Bqp) ® 0 • dqq) © {fu{t)iQ{t)) + 1 • <9go) ' 

By construction it is clear that Tit is a hyperplane in which contains the tangent space to the 

paraboloid ( |6.1| ) and does not contain the line T^qp) ■ From these properties we conclude that 'Ht sep¬ 
arates strictly the ray T^qp^ from the elements of Kt of the form fviq{t)) — fu{t){Q{'t))- Observe that 
thanks to the normalization of f^) (cf- the first part of this proof) and condition ( |6.5| ) we have 

T^Ftr [{fu{r){q{r)) + 1 • 5go)] = {fu{t){q{t)) + 1 • 5go) “od (Bqp) ®0-dqq), 

and thus 'Ht is TFj,--invariant. We conclude that the hyperplane 'Ht also separates strictly the ray 
= TFtri'H~^^-^) from the elements of Kt of the form TFt^[/^(q(r)) - /s(i)(q(T))]. Conse¬ 
quently, using the fact that 'Ht and F.qp) convex, we can use 'Ht to separate strictly F.qp) 
finite convex combination of the above-mentioned elements of Kt- Since Kt is by definition the closure 
of the set of such finite convex combinations, 'Ht is indeed the separating hyperplane described by the 
PMR □ 


A remark on smoothness of normal SR geodesics. As was proved above normal SR extremals are 
C^-smooth (and even more: their derivatives are ACB maps). It is worth discussing geometric reasons 
for this regularity in a less technical manner than in the proof of Theorem 6.15 Let q{t) be such an 
extremal and let 'Ht be the corresponding curve of supporting hyperplanes. As we know from Lemma 


Fqp) eO-dqqC'Ht and {q{t)) + dq^ £ 'Ht for every t £ [0, T]. 

These two facts are enough to exclude, at least in a heuristic way, the existence of singularities of 



Figure 6: The existence of singularities of comer- or cusp-type implies abnormality or the lack of opti¬ 
mality. 


corner-type and of cusp-type along q{t). Indeed, since 'Ht is Fj,--invariant it must be continuous. Note 
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that by the continuity of Tit, the limit subspaces © 0 • dqg coming from both sides of a given 

point Iq G [0,r] must belong to l-Lto- Now if q{t) had a comer-type singularity at ^o^ these limit 
subspaces would be different and thus they would span together the whole space ©’g(to) © 0 • dq^ (cf. 
Figure!^. In particular, fu{to){Q{to)) + 0 • ^ ^g(to) © 0 • would belong to 'Hto- Yet, since 

/u(to)(^^o)) + dqq G H-tq, this would mean that also the difference of the latter vectors, 0 + dq^, lies in 
'HtQ, which is impossible since q{t) is normal. 

In a similar way one deals with a cusp-type singularity. At a cusp we would have limit vectors 
±fu[to) ( 9 (^ 0 )) + dqg in Hto Figure|^. Now 0 -|- 2dqq, the sum of these two vectors, would belong to 
Hto which contradicts the normality of the extremal. Roughly speaking, the existence of singularities of 
corner-type or cusp-type implies dq^ G Hto, i-®-’ either a trajectory is not an extremal or it is abnormal. 


Examples. 


Example 6.17. (Geodesic equation revisited) Theorem |6.15| provides an alternative way to derive the 
geodesic equation in the Riemannian case (i.e., when V = TQ). Let {q{t),u{t)) be a trajectory of 
the SR control system (we shall assume that f^t) N normalized). Since V = TQ, by the assertion of 
Theorem |6.7| in the Riemannian case there are no abnormal extremals. 

Since = {/s(t)(7(f))}■*“ N of co-rank one, the only distribution of higher rank along q{t) con¬ 
taining is Tqp'jQ which contains also fu[t){(l{t))■ Now, by Theorem 
extremal if and only if F,{V-^)qp-^ = i.e., if 


6.15 


{q{t),u{t)) is a normal 


for every f, r G [0, T]. By the results of Theorem |2 .1 0| this is equivalent to 

]g(t) = '^q{t) ’ 

i-e-, g{[fuit),y], fu{t))q(t) = 0 whenever g{Y, f^t))q(t) = 0. Now for such a Y, after introducing a 
metric-compatible connection as in Example|6.5[ we have 


0 =g{[fu{t),y], fu{t)) = gi^hi^y ~ ^Yfuit) - T\7{fu{t),y), fu{t)) = 

5(V/a(t)Y, fu{t)) - g(^Yfu{t),fu{t)) - g{Tv{fu(t),y), fu(t)) = 

fu(t)g(y,fu(t)) giy fu(t) 2)/n(t)) g^^v^fu{t),y^,fu{t)) • 

Using the fact that g{Y, f^t)) = 0 and that g{fu(t), fu{t)) = 1 we get 


g{y^^ fu(t)fu{t))+Unit), y), fu{t)) = 0 


in agreement with the results of Example |6. 5 1 

Example 6.18 (Heisenberg system). Consider a SR system on 3 {x, y, z) constituted by a 2- 
distribution 

'^{x,y,z) = {y ■= dx - ydz, Z :=dy + xdz) 

and a SR metric such that the fields Y and Z form an orthonormal basis. Such a system is usually called 
the Heisenberg system. It is easy to check that the system in question is strongly bracket generating (cf. 
Example|6.12|) and as such does not admit any abnormal SR extremal. Our goal will thus be to determine 


the normal SR extremals using the results of Theorem 6.15 


Take now any normalized P-valued vector field X = Xt := (p{t)Y -|-V’(f)Y^here (fr = 1. We 


have = (X' = Xt := — 4>{t)Z^ and, by fhe resulfs of Theorem 6.15 fhe integral curve q{t) 


of A is a SR normal exfremal if and only if does nol confain X af any poinf q{t). Clearly 
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distribution being adx-invariant, contains the fields X', [X, X'], [X, [X, X']], etc. Skipping 

some simple calculations one can show that 

[X,X'] = -2d, + AY + BZ , 


where 


A = (j)Y(ip) - 'ipY{(p) + -h V’^) = ('*/’) - (</>) 

( 6 . 6 ) f 

B = cpZiPj) - P;Z{<P) - -Y{(P^ + V'") = cPZi^P) - V^Z((/>) . 

Let us now present vector field [X, X'] as (note that {X, X'} is a basis of sections of V) 

[X, X'] = -2d^ + aX + PX' . 


Then 


[X, [X, X']] = X(a)X + /3[X, X'] + X(/3)X' 


Now clearly {X', [X, X'], [X, [X, X']]) would contain X if and only if X(a) / 0. Thus a necessary 
condition for an integral curve q{t) of X to be a normal SR extremal is that a = const along q{t). 
Note that if X{a) = const, then the integral curves of X will indeed be normal SR extremals, as then 
[X, [X, X']] = /3[X, X'] + X{fj)X' and, consequently, wil l be equal to the 2-dimensional 

distribution (X', [X, X']) which does not contain X (cf. Corollary 6.16). 

By comparing the coefficients of [X, X'] expressed in terms of the bases {Y, Z} and {X, X'} we get 


AY + BZ = aX + 


[3X' = a{(j)Y + ipZ) + j3{'ipY — cpZ) = {acp -t- l3ip)Y + {aip — P4>)Z 


Thus, by ( |6.6| ), 

(pY {ip) — ipY {(p) — a(p = ptip 
(pZ{ip) — ipZ{(p) — aip = —ptip ■ 

Consequently, 

(p^Y {ip) — (pipY {(p) — acp'^ = ptcpip = —(pipZ{ip) + ip‘^Z{(p) + aip"^, 


which, after substituting cpY + ipZ by X, leads to 


X{iP/cP) = 


(px{ip) - ipx{(p) 


a(l + {i’/4>)'^) , 


i.e., the quotient ip/(p satisfies the equation X{x) = a{l + x^), where a is a constant. For a = 0 we 
get X = const (i.e., cp and ip are constant along q{t)), and for a / 0 we get x = arctan(af + 7 ) (i.e., 
(p = cos{at + 7 ) and ip = sin(af + 7 )). This corresponds to the two well-known families of normal 
SR extremals of the Heisenberg system (see Sec. 2 of IILS95II 1. whose projections to the (x, y)-plane are 
straight lines and circles, respectively. 


A Technical results 

Below we present technical results and their proofs used in the course of our considerations in Section]^ 






A contact covariant approach to optimal control 


33 


Measurable maps. We shall start by recalling some basic definitions and results from function theory. 

A map / : M D y —)• M” defined on an open subset 1/ C M is called measurable if the inverse image 
of every open set in is Lebesgue-measurable in V. We call / bounded if the closure of the image 
f{V) is a compact set, and locally bounded if the closure of the image of every compact set is compact. 
A point f G F is called a regular point of / : F —)• if for every open neighborhood O G /(f), we 
have 

Mr'(O)nF') ^ 

hm -—--= 1 . 

diamtyO-s-O ) 

Here the limit is taken over open neighborhoods V' 3 t and p{-) denotes the Lebesgue measure on F. 
By Lebesgue theorem, the set of regular points of a bounded and measurable map / : F —)• is of full 
measure in F. 

A map X : M D [to, fi] —^ K” is called absolutely continuous (AC, in short) if it can be presented in 
a form of an integral 

x{t) = x{to) + / u(s) ds , 

Jto 

for some integrable map v{-). Clearly, an AC map is differentiable at all regular points f of u (and thus, by 
Lebesgue theorem, a.e.) and the derivative of x{t) at such a point is simply v{t). We will be particularly 
interested in AC maps x{t) such that the derivative v{t) is locally bounded. In such a case we shall speak 
about AC maps with bounded derivative (ACB, in short). 


Measurable ODE’s. While speaking about ODE’s in the measurable setting we will need to take care 
of some technical properties of certain functions. In order to simplify the discussion let us introduce the 
following 

Definition A.l. A map F : M” x M — will be called Caratheodory if the assignment (x, t) i— 
F(x, t) is 

(A.l) locally bounded, differentiable with respect to x, and measurable with respect to t 


and if the derivative (x, t) (®) l) is 


(A.2) locally bounded, continuous with respect to x, and measurable with respect to t. 


The notion of a Caratheodory map can be naturally extended to the setting of smooth manifolds, namely 
we shall call a map F : M x M —)■ A” Caratheodory if it is Caratheodory in a (and thus in any) local 
smooth coordinate chart on M and N. Indeed, it is easy to see that this property does not depend on the 
particular choice of a chart (cf. the notion of a Caratheodory section in IIJL141 ). 

Consider now a map G : M” x M —MF and the associated non-autonomous ODE in 


(A.3) 


x{t) = G{x{t),t) 


By a {Caratheodory) solution of ( |A.3[ ) on [to, fi] with the initial condition xq at to we shall understand 
an AC map [to, ti] 9 t i—)> x(t) G which satisfies ( |A.3| ) a.e. (recall that an AC map is differentiable 
a.e.), such that x(to) = xo. Note that speaking about Caratheodory solutions makes sense also if the map 
G is defined only a.e.. 

The following fact is a straightforward generalization, to the measurable context, of the standard 
result about the existence and uniqueness of the solutions of ODE’s. 
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Theorem A.2. Assume that the map (x, t) i—)• G{x, t) is Caratheodory. Then, for each choice of 
(toja^o) G K X there exists, , in a neighborhood of to, a unique (Caratheodory) ACB solution 
1 1 — x{t; to,xo) of equation ( |A.3| ) satisfying x{to-,to, xq) = xq. 

Moreover, x{t; to, xq) is differentiable with respect to xq and the derivative -§^{1', to, xq) is contin¬ 
uous with respect to xq and ACB with respect to t. In fact, the derivative ^^{t', to,xo) is the unique 
(Caratheodory) solution of the following linear time-dependent ODE, called the variational equation, 

dG 

(A.4) V{t,xo) = -^{x{t;to,xo),t)V{t,xo) 

for a curve of linear maps V(t,xo) : —)• with the initial condition V{to,xo) = 

idT^gR"- 

The proof is given in IIBP04I (Theorem 3.3.2). Also Sec. 3 of IIGJllH may be useful. Note that equa¬ 
tion ( |A.4| ) can be obtained by differentiating the equation x{t; to,xo) = G(x{t; to,xo),t) with respect to 
xo and substituting V(t, xq) for to, xq). 


Proof of Theorem |2.10( In this paragraph we will provide a rigorous proof of Theorem |2. 10| We shall 
begin with the following lemma which characterizes the tangent map TAtr of the TD flow of a TDVF 
Xt in terms of the Lie bracket [Xt, •]. Informally speaking, transporting a given vector Zq via the map 
"TAuq along an integral curve of Xt turns out to be the same as solving the equation [Xt, ■] = 0. 

Lemma A.3. Let Xt be a Caratheodory TDVF on a manifold M, x{t) = x(f; to,xo) (with t G [foj 
its integral curve, and Auq its TDflow. Let Zo G TajpM be a tangent vector at xq and denote by Z(x(f)) 
a vector field along x{t) obtained from Zo by the action of the TDflow Auq, i-e., Z{x(t)) := TAttfiZo)- 
Then the assignment 1 1 —)■ Z(x{t)) is ACB and, moreover. 


(A.5) [Xt,Z]^p^ = 0 fora.e. t G 

Conversely, if Z is a vector field along x(f) such that the assignment t i—)• Z(x(f)) is ACB and that 
equation ( A.5| ) holds, then Z(x(t)) = TAuq [Zq), where Zq = Z(x{to))- 

Proof Consider first the vector field Z(x{f)) := TAufiZo) along x(f). The fact that t i—)• Z(x{t)) is 
ACB follows directly from the second part of the assertion of Theorem |A.2[ 

Let s I—)• zo(s) be a curve in M representing Zo, i.e., 20 ( 0 ) = xq and = Zo- It is clear 

that for each f G M the vector Z(x{f)) = TAttfiZo) is represented by the curve s 1 —>■ x(f; to, -zo('S)) = 
Aufizo{s)). Now from ( |2.3| ) we have 

[Xt,Z]^f^t) “=—Z(x(f)) - — ^_^Xt(x(t-,to,zo{s))) = 


dt 

dt ds 
d f dx 


, ... 9 

x{t;to,zo{s)) - — 
s=o ds 


s=0 


Xt(x(t-,to,zo{s))) = 




Xt{x{t,to,zo{s))) = 




s=0 


s=0 

At(x(f;fo, 2 o(s))) = 


s=0 


Xt(x{t;to, zo{s))) . 


Passing to local coordinates in which Xfix) writes as G{x, t) and using the fact that to, xq) satis¬ 
fies (|A.4|), we easily get 


dG dx d 

[Xt,Z]^p) “=■ —(x(f;fo,xo),f)^(f;fo,xo)^o - ^ ^_^G{x(t;to,zo{s)),t) = 


dGf dx dG dx d 

— {x{t-,to,xo),t)—^(t,to,xo)Zo - -^(x(f;fo, 2 ^ 0 ( 0 )),t)^(f;fo, 2 o( 0 ))— ^2;o(s) = 0 , 
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as 2 ;o( 0 ) = xo and £|o^;o(s) = Zq. 

To prove the opposite implication let now t i—)• Z{x{t)) be an ACB vector field along x{t) which 
commutes with Xt. Let us choose a family of curves s i—)• z{t, s) representing vectors Z{x{t)) for each 
t, that is z{t, 0) = x{t) and s) = Z(x(t)). Since [Xt, Z]^(^t) = 0> we have by p.3| ) 

After introducing local coordinates as above we have 

f) f) f)n f) r)r^ 

^Z(x(f)) £ ^^G{z{t,s),t) = ^(^(t,0),f)£ ^^^z{t,s) = ^{x{t),t)Z{x{t)) . 


As we see 1 1 —)• Z{x{t)) satisfies fhe linear ODE 
(A.6) 




Since, by fhe firsl pari of Ibis proof, for fhe vector field Z{x{t)) := TAu^ [Z{x{to))] we also have 
[Xt, Z]^^t) = 0 a.e. along x{t), we. conclude fhal t i—)• Z{x{t)) is also subjecf to a linear ODE of fhe 
form ( |A.6[ ). Thus fhe difference Z{x{t)) — Z{x{t)) is a Carafheodory solulion of fhe linear ODE ( A.6 l 
wilh fhe inifial value Z(x(io) — Z{x{tQ)) = 0. Using fhe uniqueness of fhe solufion (cf. Theorem 
we conclude fhal Z{x{t)) — Z(x(t)) = 0. 


A.2i 


□ 


Now we are finally ready to prove Theorem 2.10 


Proof of Theorem 2.10\ Assume firsl fhal condition ([^ of Theorem |2 .1 0| holds . Choose a basis {^lo, ■ ■ •, Z^q 
of Bx{tQ), where k is fhe rank of B, and for i = I... ,k denote Zi{x{f)) := TAu^iZnf). By fhe resulls 
fhe fields Zi are ACB along x{t) and satisfy [Xt, = 0 a.e. along x{t). Thanks to 


A. 3 


of Lemma 

condition (|^ and fhe facf fhal Auq is a local diffeomorphism, Zj’s span B. 

Lei now Z ^ Tacb{B) he any ACB section of B. We wanf to presenl if as a linear combination of 
fields Zi wifh ACB coefficienfs, i.e., Z = (fZi, where cf are ACB functions along x{t). To prove 
fhal such a presenlalion is possible first take vectors Wjo G Tj,(^to)M with j = 0,..., s such that Zio’s 
together with Wjq’s form a basis of Clearly, the fields Wj{x{t)) := TAftu(lUjo) fogelher 

wilh Zfs span TM along x{t). Since by Lemma A.3 fhese fields are ACB, given any local basis of 
smoofh vector fields U := {Ui,..., Uk+s} on AI, fhe Iransifion mafrix Tu^zw from fhe basis U to fhe 
basis ZW := {Z\,..., Zk, Wi,..., Wg] is a mafrix of ACB functions. As Tu^zw is non-degenerate, 
fhe inverse mafrix Tzw^u is also a mafrix of ACB funcfions (here we use fhe facl fhal if f is an ACB 
function separaled from 0, fhen so is ^). Thus any veclor field wilh ACB coefficienfs in basis U (in 
particular Z) will have ACB coefficienl in basis ZW. As fhe field Z is (^-valued, all Wj’s coefficienfs of 
Z vanish, i.e., Z = Yli <fZi, where (f are ACB functions along x{t) as intended. Now by fhe Leibniz 
rule|^we gel 


[Ai, Z],p) = [Xt, Y, 4>^Z,],p) = Y inxt, Zi],p) + Xt{4>^)Zi ] “A- J] Xtif^)Z, 


x{t) 


£B 


x(t) ■ 


Thus @ implies @. 

Assume now fhal condition (|^ of Theorem 2.10 holds. Lei {Zi,..., be any basis of ACB 
sections of B. The idea is to modify fhis basis to anolher basis of ACB sections {Zi,..., Zk}, such fhal 
for every i = 1,..., A: we have \Yt, Zi]^^t) = 0 a.e. along x{f). In fhe lighl of Lemma 
imply fhal fhe new basis is respected by fhe flow Att^ and, consequenlly, fhal @ holds. 


A.3 


Ibis would 


^’We leave the proof of the fact that the Lie bracket (|2.3|l satisfies the Leibniz rule 


as an exercise. 
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Due to Q, [It, is a ^-valued locally bounded measurable vector field for each i = 1,... ,k 

and thus there exists ak x k matrix of locally bounded measurable function^ along x{t) such that 


[Xt, ^i]a;(t) — 4’i Zj 


x{t) 


Now the simple idea is to look for the desired basis {Zi,..., Z^} in the form Zi = Ylj 'i’i Xj, where 
is an invertible k x k matrix of function ACB along x{t). Clearly for such Zj’s we have 

[At, Z4(t) =[Xt, to + E ^Mt) =■ 

X\t) 


x(t) 


+EEA'f/z. 

j j ^ 

As we see [Xt, ^j]a;(t) = 0 a.e. along x{t) if and only 

Xt{ip/) =■ > 

s 

i.e., the matrix 'll;/ should be a solution of a linear ODE with locally bounded measurable coefficients. 
Due to the results of Theorem A. 2 for a given initial condition, say, '^/(xo) = S/, this equation 
has a unique local ACB solution. As a consequence, we prove the local existence of the desired ba¬ 
sis {Zi, ..., Zk}, which implies □ 


Proof of Lemma r2.13( We shall end our considerations by providing the following 

Proof of Lemma The idea of the proof is very simple. Consider another D-valued Caratheodory 

TDVF X[ such that Xt = X[ along x{t). We shall show that B is X^-invariant if and only if it is X[- 
invariant along x{t). The justification of this statement is just a matter of a calculation. Observe that 
since Xt and X't are both Caratheodory and D-valued, then so is their difference Xt — Xf Given any 
local basis of smooth vector fields {Wi,..., VFs} of V we may locally represent Xt — X't as 

Xt{x) - X'tix) = ^ f{t, x)Wi{x) , 


where x) are Caratheodory functions (in the sense of Definition A.l I. Since X't = Xt along x{t) 
and Wi ’s form a basis of V, we have 


(A.7) 


(f{t,x{f)) = 0 . 


Now for any section Z G Tacb/S), using the same notation as in formula p.3|), we have 


[x't,zU)-[Xt,zU)^ 


ds 


s=0 


Xt{z{t,s)) - 


d 


X't{zit,s)) = 


d 


d 


ds 

E 

i 

E 


s=0 

A 

'Fs 

A 

Fs 


[Xt{z{t, s)) - X't{z{t, s))] = — 


s=0 


ds ls=o 

'^(ff{t,z{t,s))Wi{z{t,s)) 
, i 

d 


(pft, z{t, s))Wi{x{f))) + (f{t, x{t))— Wi{z{t, s)) 
s=0 ds s=0 


s=0 


(lf{t,z{t,s))Wi{x{t)) . 


’The existence of measurable functions (l>^ can be justified in a similar manner to the existence of ACB functions ij>i above. 
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Clearly the above expression is T^x(t) C -valued. Thus 

[Xt-,Z]x{t) = [X[,Z]x(t) 

and hence, since Z was an arbitrary ACB section of B, 

[XuB]xp) = [X',,B]xp) 


along x{t) 
mod Bxit) , 

mod Bx{t) ■ 


This ends the proof. 


□ 


A technical result about charming distributions. The following result will be needed in the course 
of Subsection 6.2 to prove that normal SR extremals are of class C^. It states that the Gram-Schmidt 
orthogonalization algorithm works well on charming distributions. 


Lemma A.4. Let B C TM be a charming distribution along a curve x : [to, fi] —^ M- Assume that B 
is equipped with a positively-defined scalar product g : B Xx(-) B ^ R such that that the assignment 
t I—7- g{x{t)) is an ACB map. Let {Xi,... ,Xs} be a family of s linearly-independent ACB sections of 
B along x{t). Then the Gram-Schmidt orthogonalization algorithm applied to {Xi ,..., produces 
a g-orthonormal family of ACB sections ofB along x{t). 

Proof Recall that the Gram-Schmidt algorithm maps a set {Xi,...,X 5 } into a ^-orthonormal set 
{X", ..., X"} constructed in the following way 


Xi^X{ :=Xi, 

X2 I—)• X2 := X2 — projjYi "^2) 


X,^X' 

X'^X" 


s-1 




2 = 1 




where proj jjf Y := j(^^X denotes the p-orthogonal projection of Y on the space spanned by X. 
Now it is enough to use the following elementary facts concerning ACB functions: 


• A sum and a difference of two ACB functions is an ACB function. 


• A product of two ACB functions is an ACB function. 

• If an ACB function f '■ [to, G] —)• M is separated from zero, then ^ is an ACB function on [foj G]- 

Clearly, in every step of the Gram-Schmidt algorithm we apply one or more of these elementary oper¬ 
ations to ACB sections (we use the fact that g is ACB and that for every ACB non-zero vector X the 
values of g{X,X) are separated from zero on Thus as a result we also obtain ACB sections 

Xf’s. □ 
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